Context Window
131,072
Tokens
Model Cost Profile
Developer: qwen
Pricing updated Mar 11, 2026
Live Pricing
Input: $0.1040
Output: $0.4160
Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data
Qwen3 VL 32B Instruct by qwen is designed for applications requiring extensive context handling, with a remarkable context window of 131072 tokens, making it ideal for complex tasks like document summarization and conversational AI. Teams leveraging this API model can expect a cost-effective input price of $0.10 per million tokens, while output pricing stands at $0.42 per million tokens, allowing for scalable budgeting based on usage needs. This model is particularly suited for enterprises that demand high throughput and nuanced understanding in their AI-driven solutions.
Context Window
131,072
Tokens
Input Price / 1M
$0.1040
Prompt tokens
Output Price / 1M
$0.4160
Completion tokens
Intelligence (MMLU)
79.1
Massive Multitask Language Understanding
Standardized evaluation scores for Qwen: Qwen3 VL 32B Instruct.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1040 |
| Output (Completion) | $0.4160 |
Price History
Current Input / 1M
$0.1040
Current Output / 1M
$0.4160
Estimate monthly spend for Qwen: Qwen3 VL 32B Instruct based on your workload.
Estimated Monthly Cost
$7.59
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Qwen: Qwen3 VL 32B Instruct.
Qwen: Qwen3 VL 32B Instruct input pricing is $0.1040 per 1M tokens based on the latest synced provider data.
Qwen: Qwen3 VL 32B Instruct output pricing is $0.4160 per 1M tokens based on the latest synced provider data.
Qwen: Qwen3 VL 32B Instruct supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.