Model Cost Profile

Qwen: QwQ 32B

Developer: qwen· Tokenizer: Qwen · Instruct: qwq · Quantization: fp8

Canonical ID: qwen/qwq-32b

Pricing updated Apr 24, 2026

Input rank: #126Output rank: #130

Live Pricing

Input: $0.1500

Output: $0.5800

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Qwen: QwQ 32B, developed by qwen, offers a substantial context window of 32,768 tokens, making it suitable for applications requiring extensive text comprehension, such as document summarization and conversational AI. With an input price of $0.15 per million tokens and an output price of $0.40 per million tokens, teams can effectively manage their budgets while leveraging this model for high-volume tasks. This pricing structure allows organizations to optimize costs for both training and deployment, particularly in projects involving large datasets or real-time interactions.

🔧 Tool Calling🔌 MCP Compatible🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.02

Max Output

131,072

Completion tokens

Input Price / 1M

$0.1500

Prompt tokens

Output Price / 1M

$0.5800

Completion tokens

Top Benchmark

76.4

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: QwQ 32B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	59.3	#68 of 125	artificial_analysis
MMLU	76.4	#61 of 127	artificial_analysis

Price History

Qwen: QwQ 32B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens+45.0%

Current Input / 1M

$0.1500

Current Output / 1M

$0.5800

Performance History

Qwen: QwQ 32B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1500
Output (Completion)	$0.5800

Compare with Qwen: Qwen3 Coder Next Compare with AllenAI: Olmo 3 32B Think Compare with Arcee AI: Trinity Large Preview

Cost Calculator

Estimate monthly spend for Qwen: QwQ 32B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$11

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$11 Free Models Router$0.00−$11 Google: Gemma 3 12B (free)$0.00−$11 Google: Gemma 3 27B (free)$0.00−$11

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: QwQ 32B vs Baidu: Qianfan-OCR-Fast (free)Qwen: QwQ 32B vs Free Models Router Qwen: QwQ 32B vs Google: Gemma 3 12B (free)Qwen: QwQ 32B vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

59.3

#68 of 125

artificial_analysis

MMLU

76.4

#61 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1500

Output (Completion)

$0.5800

Cost Calculator

Estimate monthly spend for Qwen: QwQ 32B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$11

25M input + 12M output tokens