Context Window
262,144
Input tokens
Full-context input ≈ $0.05
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8
Canonical ID: qwen/qwen3-vl-235b-a22b-instruct
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.2000
Output: $0.8800
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Qwen3 VL 235B A22B Instruct is designed for advanced natural language processing tasks, making it suitable for applications such as chatbots, content generation, and data analysis. With a context window of 262,144 tokens, teams can handle extensive inputs and maintain coherence over longer conversations or documents. The pricing structure, at $0.20 per million input tokens and $0.88 per million output tokens, allows teams to optimize costs based on their specific usage patterns while scaling their applications efficiently.
Context Window
262,144
Input tokens
Full-context input ≈ $0.05
Max Output
—
Not specified
Input Price / 1M
$0.2000
Prompt tokens
Output Price / 1M
$0.8800
Completion tokens
Top Benchmark
83.6
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: Qwen3 VL 235B A22B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.2000
Current Output / 1M
$0.8800
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
98.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.2000 |
| Output (Completion) | $0.8800 |
| Cache Read | $0.1100 |
Estimate monthly spend for Qwen: Qwen3 VL 235B A22B Instruct based on your workload.
Estimated Monthly Cost
$16
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.