Context Window
327,680
Input tokens
Full-context input โ $0.03
Model Cost Profile
Developer: meta-llamaยท Tokenizer: Llama4 ยท Quantization: fp8
Canonical ID: meta-llama/llama-4-scout-17b-16e-instruct
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.0800
Output: $0.3000
Last synced Apr 25, 2026
Meta: Llama 4 Scout, developed by meta-llama, offers an extensive context window of 327,680 tokens, making it suitable for complex tasks such as long-form content generation and detailed data analysis. Teams leveraging this API model will find that the input pricing is set at $0.08 per million tokens, while output costs are $0.30 per million tokens, allowing for scalable budgeting based on usage. This model is ideal for applications requiring in-depth contextual understanding, such as conversational agents and advanced research tools.
Context Window
327,680
Input tokens
Full-context input โ $0.03
Max Output
16,384
Completion tokens
Input Price / 1M
$0.0800
Prompt tokens
Output Price / 1M
$0.3000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.0800
Current Output / 1M
$0.3000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0800 |
| Output (Completion) | $0.3000 |
Estimate monthly spend for Meta: Llama 4 Scout based on your workload.
Estimated Monthly Cost
$5.60
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.