Model Cost Profile

Qwen: Qwen3 VL 235B A22B Thinking

Developer: qwen· Tokenizer: Qwen3 · Quantization: unknown

Canonical ID: qwen/qwen3-vl-235b-a22b-thinking

Pricing updated Apr 24, 2026

Input rank: #163Output rank: #243

Live Pricing

Input: $0.2600

Output: $2.60

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Qwen3 VL 235B A22B Thinking, developed by qwen, offers a substantial context window of 131,072 tokens, making it suitable for complex tasks such as long-form content generation and detailed data analysis. With a pricing structure of $0.00 for both input and output per million tokens, teams can leverage this model for extensive projects without incurring costs, allowing for scalable experimentation and deployment. Its capabilities are particularly beneficial for organizations needing to process large volumes of information while maintaining high-quality outputs in real-time applications.

👁 Vision🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.03

Max Output

32,768

Completion tokens

Input Price / 1M

$0.2600

Prompt tokens

Output Price / 1M

$2.60

Completion tokens

Top Benchmark

83.6

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 VL 235B A22B Thinking. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	77.2	#27 of 125	artificial_analysis
MMLU	83.6	#21 of 127	artificial_analysis

Price History

Qwen: Qwen3 VL 235B A22B Thinking Pricing Trend

Input / 1M tokensOutput / 1M tokens

Current Input / 1M

$0.2600

Current Output / 1M

$2.60

Performance History

Qwen: Qwen3 VL 235B A22B Thinking Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2600
Output (Completion)	$2.60

Compare with Qwen: Qwen Plus 0728 Compare with Qwen: Qwen Plus 0728 (thinking)Compare with Qwen: Qwen-Plus

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 VL 235B A22B Thinking based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$38

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$38 Free Models Router$0.00−$38 Google: Gemma 3 12B (free)$0.00−$38 Google: Gemma 3 27B (free)$0.00−$38

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 VL 235B A22B Thinking vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 VL 235B A22B Thinking vs Free Models Router Qwen: Qwen3 VL 235B A22B Thinking vs Google: Gemma 3 12B (free)Qwen: Qwen3 VL 235B A22B Thinking vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

77.2

#27 of 125

artificial_analysis

MMLU

83.6

#21 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.2600

Output (Completion)

$2.60

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 VL 235B A22B Thinking based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$38

25M input + 12M output tokens