Model Cost Profile

Qwen: Qwen2.5 VL 32B Instruct

Developer: qwen

Pricing updated Mar 11, 2026

Input rank: #139Output rank: #139

Live Pricing

Input: $0.2000

Output: $0.6000

Get API Key View full pricing leaderboard

Pricing via OpenRouter API · Last synced Mar 11, 2026 · MMLU score via public benchmark data

Qwen: Qwen2.5 VL 32B Instruct is designed for applications requiring extensive context management, accommodating up to 128,000 tokens, making it suitable for complex tasks like document summarization and conversational AI. With an input price of $0.20 per million tokens and an output price of $0.60 per million tokens, teams can effectively budget for high-volume projects while optimizing their API usage. This model's capabilities are ideal for businesses looking to enhance customer interactions or automate content generation at scale.

👁 Vision📋 Structured Output

Context Window

128,000

Tokens

Input Price / 1M

$0.2000

Prompt tokens

Output Price / 1M

$0.6000

Completion tokens

Intelligence (MMLU)

63.5

Massive Multitask Language Understanding

Benchmark Scores

Standardized evaluation scores for Qwen: Qwen2.5 VL 32B Instruct.

Benchmark	Score	Rank	Source
GPQA	41.7	#102 of 118	artificial_analysis
MMLU	63.5	#105 of 121	artificial_analysis

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2000
Output (Completion)	$0.6000

Compare with Qwen: Qwen3 VL 235B A22B Instruct Compare with AllenAI: Molmo2 8B Compare with AllenAI: Olmo 3.1 32B Instruct

Price History

Qwen: Qwen2.5 VL 32B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Mar 7 — Mar 11

Current Input / 1M

$0.2000

Current Output / 1M

$0.6000

Cost Calculator

Estimate monthly spend for Qwen: Qwen2.5 VL 32B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$12

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$12 Arcee AI: Trinity Mini (free)$0.00−$12 Google: Gemma 3 12B (free)$0.00−$12 Google: Gemma 3 27B (free)$0.00−$12

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen2.5 VL 32B Instruct vs Arcee AI: Trinity Large Preview (free)Qwen: Qwen2.5 VL 32B Instruct vs Arcee AI: Trinity Mini (free)Qwen: Qwen2.5 VL 32B Instruct vs Google: Gemma 3 12B (free)Qwen: Qwen2.5 VL 32B Instruct vs Google: Gemma 3 27B (free)

FAQ

Common pricing and benchmark questions for Qwen: Qwen2.5 VL 32B Instruct.

How much does Qwen: Qwen2.5 VL 32B Instruct cost per 1M input tokens?

Qwen: Qwen2.5 VL 32B Instruct input pricing is $0.2000 per 1M tokens based on the latest synced provider data.

How much does Qwen: Qwen2.5 VL 32B Instruct cost per 1M output tokens?

Qwen: Qwen2.5 VL 32B Instruct output pricing is $0.6000 per 1M tokens based on the latest synced provider data.

What context window does Qwen: Qwen2.5 VL 32B Instruct support?

Qwen: Qwen2.5 VL 32B Instruct supports a context window of 128,000 tokens.

How can I compare Qwen: Qwen2.5 VL 32B Instruct with cheaper alternatives?

Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.