Context Window
60,000
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: unknown
Canonical ID: meta-llama/llama-3.2-1b-instruct
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0270
Output: $0.2000
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Meta: Llama 3.2 1B Instruct is designed for applications requiring extensive context, with a remarkable capacity of 60,000 tokens, making it suitable for complex dialogue systems and large-scale document analysis. Teams leveraging this API model can expect a cost-effective input pricing of $0.03 per million tokens and an output pricing of $0.20 per million tokens, allowing for budget-friendly scalability in data-intensive projects. Its architecture supports diverse use cases, including customer support automation and content generation, providing flexibility for various industries.
Context Window
60,000
Input tokens
Full-context input ≈ $0.00
Max Output
—
Not specified
Input Price / 1M
$0.0270
Prompt tokens
Output Price / 1M
$0.2000
Completion tokens
Top Benchmark
20.0
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Meta: Llama 3.2 1B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0270
Current Output / 1M
$0.2000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0270 |
| Output (Completion) | $0.2000 |
Estimate monthly spend for Meta: Llama 3.2 1B Instruct based on your workload.
Estimated Monthly Cost
$3.08
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.