Context Window
131,072
Input tokens
Full-context input ≈ $0.05
Model Cost Profile
Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8
Canonical ID: meta-llama/llama-3.1-70b-instruct
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.4000
Output: $0.4000
Last synced Apr 25, 2026
Meta: Llama 3.1 70B Instruct is designed for complex instruction-following tasks, making it suitable for applications in customer support automation and content generation. With a context window of 131,072 tokens, this model can handle extensive dialogues and large documents, providing teams with the ability to maintain context over longer interactions. The pricing structure at $0.40 per million tokens for both input and output allows organizations to budget effectively while scaling their usage based on project demands.
Context Window
131,072
Input tokens
Full-context input ≈ $0.05
Max Output
16,384
Completion tokens
Input Price / 1M
$0.4000
Prompt tokens
Output Price / 1M
$0.4000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.4000
Current Output / 1M
$0.4000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.4000 |
| Output (Completion) | $0.4000 |
Estimate monthly spend for Meta: Llama 3.1 70B Instruct based on your workload.
Estimated Monthly Cost
$15
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.