Context Window
131,072
Input tokens
Full-context input ≈ $0.06
Model Cost Profile
Developer: meta-llama· Tokenizer: Llama3 · Instruct: none · Quantization: unknown
Canonical ID: meta-llama/llama-guard-3-8b
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.4800
Output: $0.0300
Last synced Apr 24, 2026
Llama Guard 3 8B, developed by meta-llama, features an extensive context window of 131072 tokens, making it ideal for applications requiring in-depth analysis and long-form content generation. With an input cost of $0.02 per 1 million tokens and an output cost of $0.06 per 1 million tokens, teams can effectively manage their budget while leveraging the model for tasks such as customer support automation and complex data summarization. This model's scalability and pricing structure are particularly beneficial for organizations that need to process large volumes of text efficiently.
Context Window
131,072
Input tokens
Full-context input ≈ $0.06
Max Output
—
Not specified
Input Price / 1M
$0.4800
Prompt tokens
Output Price / 1M
$0.0300
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.4800
Current Output / 1M
$0.0300
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.4800 |
| Output (Completion) | $0.0300 |
Estimate monthly spend for Llama Guard 3 8B based on your workload.
Estimated Monthly Cost
$12
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.