Model Cost Profile

Qwen: Qwen2.5 VL 32B Instruct

Developer: qwen

Pricing updated Mar 11, 2026

Input rank: #139Output rank: #139

Live Pricing

Input: $0.2000

Output: $0.6000

Pricing via OpenRouter API ยท Last synced Mar 11, 2026 ยท MMLU score via public benchmark data

Qwen: Qwen2.5 VL 32B Instruct is designed for applications requiring extensive context management, accommodating up to 128,000 tokens, making it suitable for complex tasks like document summarization and conversational AI. With an input price of $0.20 per million tokens and an output price of $0.60 per million tokens, teams can effectively budget for high-volume projects while optimizing their API usage. This model's capabilities are ideal for businesses looking to enhance customer interactions or automate content generation at scale.

๐Ÿ‘ Vision๐Ÿ“‹ Structured Output

Context Window

128,000

Tokens

Input Price / 1M

$0.2000

Prompt tokens

Output Price / 1M

$0.6000

Completion tokens

Intelligence (MMLU)

63.5

Massive Multitask Language Understanding

Benchmark Scores

Standardized evaluation scores for Qwen: Qwen2.5 VL 32B Instruct.

BenchmarkScoreRankSource
GPQA41.7#102 of 118artificial_analysis
MMLU63.5#105 of 121artificial_analysis

Price History

Qwen: Qwen2.5 VL 32B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%
Mar 7 โ€” Mar 11
$0.2000$0.4000$0.6000Mar 7Mar 8Mar 9Mar 10Mar 11

Current Input / 1M

$0.2000

Current Output / 1M

$0.6000

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

FAQ

Common pricing and benchmark questions for Qwen: Qwen2.5 VL 32B Instruct.

How much does Qwen: Qwen2.5 VL 32B Instruct cost per 1M input tokens?

Qwen: Qwen2.5 VL 32B Instruct input pricing is $0.2000 per 1M tokens based on the latest synced provider data.

How much does Qwen: Qwen2.5 VL 32B Instruct cost per 1M output tokens?

Qwen: Qwen2.5 VL 32B Instruct output pricing is $0.6000 per 1M tokens based on the latest synced provider data.

What context window does Qwen: Qwen2.5 VL 32B Instruct support?

Qwen: Qwen2.5 VL 32B Instruct supports a context window of 128,000 tokens.

How can I compare Qwen: Qwen2.5 VL 32B Instruct with cheaper alternatives?

Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.