Context Window
8,192
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: bf16
Canonical ID: meta-llama/llama-3-8b-instruct
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0300
Output: $0.0400
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Meta: Llama 3 8B Instruct is designed for applications requiring nuanced instruction understanding, making it ideal for customer support automation and personalized content generation. With a context window of 8192 tokens, this model can effectively handle extensive dialogues, enhancing user interaction in complex scenarios. Teams utilizing this API can expect a cost of $0.03 per million input tokens and $0.04 per million output tokens, allowing for scalable budgeting based on usage.
Context Window
8,192
Input tokens
Full-context input ≈ $0.00
Max Output
16,384
Completion tokens
Input Price / 1M
$0.0300
Prompt tokens
Output Price / 1M
$0.0400
Completion tokens
Top Benchmark
47.6
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Meta: Llama 3 8B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0300
Current Output / 1M
$0.0400
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0300 |
| Output (Completion) | $0.0400 |
Estimate monthly spend for Meta: Llama 3 8B Instruct based on your workload.
Estimated Monthly Cost
$1.23
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.