Model Cost Profile

Qwen: Qwen3 VL 235B A22B Instruct

Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8

Canonical ID: qwen/qwen3-vl-235b-a22b-instruct

Pricing updated Apr 24, 2026

Input rank: #142Output rank: #162

Live Pricing

Input: $0.2000

Output: $0.8800

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Qwen3 VL 235B A22B Instruct is designed for advanced natural language processing tasks, making it suitable for applications such as chatbots, content generation, and data analysis. With a context window of 262,144 tokens, teams can handle extensive inputs and maintain coherence over longer conversations or documents. The pricing structure, at $0.20 per million input tokens and $0.88 per million output tokens, allows teams to optimize costs based on their specific usage patterns while scaling their applications efficiently.

💡 Enable prompt caching to save 45% on repeated input tokens ($0.1100/M cached vs $0.2000/M standard).

👁 Vision🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

262,144

Input tokens

Full-context input ≈ $0.05

Max Output

—

Not specified

Input Price / 1M

$0.2000

Prompt tokens

Output Price / 1M

$0.8800

Completion tokens

Top Benchmark

83.6

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 VL 235B A22B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	77.2	#28 of 125	artificial_analysis
MMLU	83.6	#20 of 127	artificial_analysis

Price History

Qwen: Qwen3 VL 235B A22B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.2000

Current Output / 1M

$0.8800

Performance History

Qwen: Qwen3 VL 235B A22B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

98.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2000
Output (Completion)	$0.8800
Cache Read	$0.1100

Compare with Qwen: Qwen3 Coder Flash Compare with AllenAI: Olmo 3.1 32B Instruct Compare with DeepSeek: DeepSeek V3 0324

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 VL 235B A22B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$16

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$16 Free Models Router$0.00−$16 Google: Gemma 3 12B (free)$0.00−$16 Google: Gemma 3 27B (free)$0.00−$16

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 VL 235B A22B Instruct vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 VL 235B A22B Instruct vs Free Models Router Qwen: Qwen3 VL 235B A22B Instruct vs Google: Gemma 3 12B (free)Qwen: Qwen3 VL 235B A22B Instruct vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

77.2

#28 of 125

artificial_analysis

MMLU

83.6

#20 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.2000

Output (Completion)

$0.8800

Cache Read

$0.1100

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 VL 235B A22B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$16

25M input + 12M output tokens