Context Window
65,536
Input tokens
Full-context input ≈ $0.01
Model Cost Profile
Developer: allenai· Tokenizer: Other · Quantization: bf16
Canonical ID: allenai/olmo-3-32b-think-20251121
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.1500
Output: $0.5000
Last synced Apr 24, 2026 · MMLU score via public benchmark data
AllenAI's Olmo 3 32B Think model offers a substantial context window of 65,536 tokens, making it ideal for applications requiring extensive text analysis, such as legal document review and long-form content generation. With an input price of $0.15 per million tokens and an output price of $0.50 per million tokens, teams can effectively manage costs while leveraging the model for complex tasks like summarization and conversational AI. This pricing structure allows organizations to scale their usage based on project needs, optimizing budget allocation for AI-driven solutions.
Context Window
65,536
Input tokens
Full-context input ≈ $0.01
Max Output
65,536
Completion tokens
Input Price / 1M
$0.1500
Prompt tokens
Output Price / 1M
$0.5000
Completion tokens
Top Benchmark
76.3
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for AllenAI: Olmo 3 32B Think. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.1500
Current Output / 1M
$0.5000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1500 |
| Output (Completion) | $0.5000 |
Estimate monthly spend for AllenAI: Olmo 3 32B Think based on your workload.
Estimated Monthly Cost
$9.75
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.