Context Window
163,840
Input tokens
Full-context input โ $0.03
Model Cost Profile
Developer: meta-llamaยท Tokenizer: Other ยท Quantization: bf16
Canonical ID: meta-llama/llama-guard-4-12b
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.1800
Output: $0.1800
Last synced Apr 24, 2026
Meta: Llama Guard 4 12B, developed by meta-llama, offers a substantial context window of 163,840 tokens, making it suitable for applications requiring extensive text analysis and generation, such as legal document review or long-form content creation. With a competitive pricing structure of $0.18 per million tokens for both input and output, teams can effectively manage costs while leveraging the model for high-volume tasks. This model is particularly advantageous for organizations needing to process large datasets or engage in complex conversational AI scenarios.
Context Window
163,840
Input tokens
Full-context input โ $0.03
Max Output
โ
Not specified
Input Price / 1M
$0.1800
Prompt tokens
Output Price / 1M
$0.1800
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.1800
Current Output / 1M
$0.1800
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1800 |
| Output (Completion) | $0.1800 |
Estimate monthly spend for Meta: Llama Guard 4 12B based on your workload.
Estimated Monthly Cost
$6.66
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.