Context Window
1,048,576
Input tokens
Full-context input โ $0.16
Model Cost Profile
Developer: meta-llamaยท Tokenizer: Llama4 ยท Quantization: fp8
Canonical ID: meta-llama/llama-4-maverick-17b-128e-instruct
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.1500
Output: $0.6000
Last synced Apr 25, 2026
Meta: Llama 4 Maverick offers a substantial context window of 1,048,576 tokens, making it suitable for applications requiring extensive data processing, such as document summarization and complex conversational agents. With an input price of $0.15 per 1 million tokens and an output price of $0.60 per 1 million tokens, teams can effectively budget for high-volume usage while optimizing their operational costs. This model is ideal for organizations that need to analyze large datasets or generate detailed content without sacrificing performance or incurring excessive expenses.
Context Window
1,048,576
Input tokens
Full-context input โ $0.16
Max Output
16,384
Completion tokens
Input Price / 1M
$0.1500
Prompt tokens
Output Price / 1M
$0.6000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.1500
Current Output / 1M
$0.6000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.9%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1500 |
| Output (Completion) | $0.6000 |
Estimate monthly spend for Meta: Llama 4 Maverick based on your workload.
Estimated Monthly Cost
$11
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.