Context Window
128,000
Tokens
Model Cost Profile
Developer: qwen
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.2000
Output: $0.6000
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
Qwen: Qwen2.5 VL 32B Instruct is designed for applications requiring extensive context management, accommodating up to 128,000 tokens, making it suitable for complex tasks like document summarization and conversational AI. With an input price of $0.20 per million tokens and an output price of $0.60 per million tokens, teams can effectively budget for high-volume projects while optimizing their API usage. This model's capabilities are ideal for businesses looking to enhance customer interactions or automate content generation at scale.
Context Window
128,000
Tokens
Input Price / 1M
$0.2000
Prompt tokens
Output Price / 1M
$0.6000
Completion tokens
Intelligence (MMLU)
63.5
Massive Multitask Language Understanding
Standardized evaluation scores for Qwen: Qwen2.5 VL 32B Instruct.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.2000 |
| Output (Completion) | $0.6000 |
Price History
Current Input / 1M
$0.2000
Current Output / 1M
$0.6000
Estimate monthly spend for Qwen: Qwen2.5 VL 32B Instruct based on your workload.
Estimated Monthly Cost
$12
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Qwen: Qwen2.5 VL 32B Instruct.
Qwen: Qwen2.5 VL 32B Instruct input pricing is $0.2000 per 1M tokens based on the latest synced provider data.
Qwen: Qwen2.5 VL 32B Instruct output pricing is $0.6000 per 1M tokens based on the latest synced provider data.
Qwen: Qwen2.5 VL 32B Instruct supports a context window of 128,000 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.