Context Window
131,072
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp16
Canonical ID: meta-llama/llama-3.2-3b-instruct
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.0000
Output: $0.0000
Last synced Apr 25, 2026
Meta: Llama 3.2 3B Instruct is a versatile API model designed for tasks such as natural language understanding, text generation, and conversational AI, making it suitable for developers and businesses looking to enhance user interactions. With an extensive context window of 131,072 tokens, this model can handle large documents and complex queries, providing a robust solution for applications requiring deep contextual comprehension. As a free option, it eliminates input and output costs, allowing teams to explore advanced AI capabilities without financial constraints, making it an ideal choice for startups and research projects.
Context Window
131,072
Input tokens
Full-context input ≈ $0.00
Max Output
—
Not specified
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
94.6%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Estimate monthly spend for Meta: Llama 3.2 3B Instruct (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.