Context Window
131,072
Input tokens
Full-context input ≈ $0.02
Model Cost Profile
Developer: qwen· Tokenizer: Qwen · Instruct: qwq · Quantization: fp8
Canonical ID: qwen/qwq-32b
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.1500
Output: $0.5800
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Qwen: QwQ 32B, developed by qwen, offers a substantial context window of 32,768 tokens, making it suitable for applications requiring extensive text comprehension, such as document summarization and conversational AI. With an input price of $0.15 per million tokens and an output price of $0.40 per million tokens, teams can effectively manage their budgets while leveraging this model for high-volume tasks. This pricing structure allows organizations to optimize costs for both training and deployment, particularly in projects involving large datasets or real-time interactions.
Context Window
131,072
Input tokens
Full-context input ≈ $0.02
Max Output
131,072
Completion tokens
Input Price / 1M
$0.1500
Prompt tokens
Output Price / 1M
$0.5800
Completion tokens
Top Benchmark
76.4
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: QwQ 32B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.1500
Current Output / 1M
$0.5800
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1500 |
| Output (Completion) | $0.5800 |
Estimate monthly spend for Qwen: QwQ 32B based on your workload.
Estimated Monthly Cost
$11
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.