Model Cost Profile

AllenAI: Olmo 3 32B Think

Developer: allenai· Tokenizer: Other · Quantization: bf16

Canonical ID: allenai/olmo-3-32b-think-20251121

Pricing updated Apr 24, 2026

Input rank: #113Output rank: #117

Live Pricing

Input: $0.1500

Output: $0.5000

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

AllenAI's Olmo 3 32B Think model offers a substantial context window of 65,536 tokens, making it ideal for applications requiring extensive text analysis, such as legal document review and long-form content generation. With an input price of $0.15 per million tokens and an output price of $0.50 per million tokens, teams can effectively manage costs while leveraging the model for complex tasks like summarization and conversational AI. This pricing structure allows organizations to scale their usage based on project needs, optimizing budget allocation for AI-driven solutions.

📋 Structured Output🧠 Reasoning

Context Window

65,536

Input tokens

Full-context input ≈ $0.01

Max Output

65,536

Completion tokens

Input Price / 1M

$0.1500

Prompt tokens

Output Price / 1M

$0.5000

Completion tokens

Top Benchmark

76.3

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for AllenAI: Olmo 3 32B Think. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	59.1	#69 of 125	artificial_analysis
MMLU	76.3	#62 of 127	artificial_analysis

Price History

AllenAI: Olmo 3 32B Think Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.1500

Current Output / 1M

$0.5000

Performance History

AllenAI: Olmo 3 32B Think Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1500
Output (Completion)	$0.5000

Compare with AllenAI: Olmo 3.1 32B Instruct Compare with Arcee AI: Trinity Large Preview Compare with Cohere: Command R (08-2024)

Cost Calculator

Estimate monthly spend for AllenAI: Olmo 3 32B Think based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$9.75

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$9.75 Free Models Router$0.00−$9.75 Google: Gemma 3 12B (free)$0.00−$9.75 Google: Gemma 3 27B (free)$0.00−$9.75

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

AllenAI: Olmo 3 32B Think vs Baidu: Qianfan-OCR-Fast (free)AllenAI: Olmo 3 32B Think vs Free Models Router AllenAI: Olmo 3 32B Think vs Google: Gemma 3 12B (free)AllenAI: Olmo 3 32B Think vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

59.1

#69 of 125

artificial_analysis

MMLU

76.3

#62 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1500

Output (Completion)

$0.5000

Cost Calculator

Estimate monthly spend for AllenAI: Olmo 3 32B Think based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$9.75

25M input + 12M output tokens