Context Window
32,768
Tokens
Model Cost Profile
Developer: qwen
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.1500
Output: $0.4000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
Qwen: QwQ 32B, developed by qwen, offers a substantial context window of 32,768 tokens, making it suitable for applications requiring extensive text comprehension, such as document summarization and conversational AI. With an input price of $0.15 per million tokens and an output price of $0.40 per million tokens, teams can effectively manage their budgets while leveraging this model for high-volume tasks. This pricing structure allows organizations to optimize costs for both training and deployment, particularly in projects involving large datasets or real-time interactions.
Context Window
32,768
Tokens
Input Price / 1M
$0.1500
Prompt tokens
Output Price / 1M
$0.4000
Completion tokens
Intelligence (MMLU)
76.4
Massive Multitask Language Understanding
Standardized evaluation scores for Qwen: QwQ 32B.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1500 |
| Output (Completion) | $0.4000 |
Price History
Current Input / 1M
$0.1500
Current Output / 1M
$0.4000
Estimate monthly spend for Qwen: QwQ 32B based on your workload.
Estimated Monthly Cost
$8.55
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Qwen: QwQ 32B.
Qwen: QwQ 32B input pricing is $0.1500 per 1M tokens based on the latest synced provider data.
Qwen: QwQ 32B output pricing is $0.4000 per 1M tokens based on the latest synced provider data.
Qwen: QwQ 32B supports a context window of 32,768 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.