Context Window
8,192
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8
Canonical ID: meta-llama/llama-3-70b-instruct
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.5100
Output: $0.7400
Last synced Apr 25, 2026
Meta: Llama 3 70B Instruct is designed for advanced natural language processing tasks, making it suitable for applications in customer support automation and content generation. With a context window of 8192 tokens, it allows for handling longer conversations and complex queries, which is beneficial for teams needing detailed interactions. The pricing structure, at $0.51 for input and $0.74 for output per 1M tokens, can impact budget considerations for teams planning extensive usage in their projects.
Context Window
8,192
Input tokens
Full-context input ≈ $0.00
Max Output
8,000
Completion tokens
Input Price / 1M
$0.5100
Prompt tokens
Output Price / 1M
$0.7400
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.5100
Current Output / 1M
$0.7400
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.5100 |
| Output (Completion) | $0.7400 |
Estimate monthly spend for Meta: Llama 3 70B Instruct based on your workload.
Estimated Monthly Cost
$22
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.