Context Window
131,072
Input tokens
Full-context input ≈ $0.02
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Quantization: unknown
Canonical ID: qwen/qwen3-vl-30b-a3b-instruct
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.1300
Output: $0.5200
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Qwen3 VL 30B A3B Instruct, developed by Qwen, offers a substantial context window of 131,072 tokens, making it suitable for applications requiring extensive text analysis or generation, such as legal document review or long-form content creation. With an input price of $0.13 per million tokens and an output price of $0.52 per million tokens, teams can effectively manage costs while leveraging the model's capabilities for complex tasks. This model is particularly advantageous for organizations needing to process large datasets or engage in detailed conversational AI applications, ensuring both efficiency and scalability.
Context Window
131,072
Input tokens
Full-context input ≈ $0.02
Max Output
32,768
Completion tokens
Input Price / 1M
$0.1300
Prompt tokens
Output Price / 1M
$0.5200
Completion tokens
Top Benchmark
72.5
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: Qwen3 VL 30B A3B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.1300
Current Output / 1M
$0.5200
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1300 |
| Output (Completion) | $0.5200 |
Estimate monthly spend for Qwen: Qwen3 VL 30B A3B Instruct based on your workload.
Estimated Monthly Cost
$9.49
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.