Context Window
131,072
Tokens
Model Cost Profile
Developer: meta-llama
Pricing updated Mar 10, 2026
Llama Guard 3 8B, developed by meta-llama, features an extensive context window of 131072 tokens, making it ideal for applications requiring in-depth analysis and long-form content generation. With an input cost of $0.02 per 1 million tokens and an output cost of $0.06 per 1 million tokens, teams can effectively manage their budget while leveraging the model for tasks such as customer support automation and complex data summarization. This model's scalability and pricing structure are particularly beneficial for organizations that need to process large volumes of text efficiently.
Context Window
131,072
Tokens
Input Price / 1M
$0.0200
Prompt tokens
Output Price / 1M
$0.0600
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0200 |
| Output (Completion) | $0.0600 |
Price History
Current Input / 1M
$0.0200
Current Output / 1M
$0.0600
Estimate monthly spend for Llama Guard 3 8B based on your workload.
Estimated Monthly Cost
$1.22
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Llama Guard 3 8B.
Llama Guard 3 8B input pricing is $0.0200 per 1M tokens based on the latest synced provider data.
Llama Guard 3 8B output pricing is $0.0600 per 1M tokens based on the latest synced provider data.
Llama Guard 3 8B supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.