Context Window
131,072
Input tokens
Full-context input ≈ $0.03
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Quantization: unknown
Canonical ID: qwen/qwen3-vl-235b-a22b-thinking
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.2600
Output: $2.60
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Qwen3 VL 235B A22B Thinking, developed by qwen, offers a substantial context window of 131,072 tokens, making it suitable for complex tasks such as long-form content generation and detailed data analysis. With a pricing structure of $0.00 for both input and output per million tokens, teams can leverage this model for extensive projects without incurring costs, allowing for scalable experimentation and deployment. Its capabilities are particularly beneficial for organizations needing to process large volumes of information while maintaining high-quality outputs in real-time applications.
Context Window
131,072
Input tokens
Full-context input ≈ $0.03
Max Output
32,768
Completion tokens
Input Price / 1M
$0.2600
Prompt tokens
Output Price / 1M
$2.60
Completion tokens
Top Benchmark
83.6
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: Qwen3 VL 235B A22B Thinking. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.2600
Current Output / 1M
$2.60
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.2600 |
| Output (Completion) | $2.60 |
Estimate monthly spend for Qwen: Qwen3 VL 235B A22B Thinking based on your workload.
Estimated Monthly Cost
$38
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.