Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8
Canonical ID: qwen/qwen3-30b-a3b-instruct-2507
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0900
Output: $0.3000
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Qwen3 30B A3B Instruct 2507, developed by Qwen, offers a substantial context window of 262,144 tokens, making it ideal for applications requiring extensive data processing such as document summarization and complex dialogue systems. With an input cost of $0.09 per million tokens and an output cost of $0.30 per million tokens, teams can effectively manage their budgets while leveraging the model's advanced capabilities for tasks like content generation and interactive AI solutions. This pricing structure allows organizations to scale their usage based on specific project needs, ensuring cost-effectiveness in high-demand scenarios.
Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Max Output
262,144
Completion tokens
Input Price / 1M
$0.0900
Prompt tokens
Output Price / 1M
$0.3000
Completion tokens
Top Benchmark
80.5
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: Qwen3 30B A3B Instruct 2507. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0900
Current Output / 1M
$0.3000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
91.4%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0900 |
| Output (Completion) | $0.3000 |
Estimate monthly spend for Qwen: Qwen3 30B A3B Instruct 2507 based on your workload.
Estimated Monthly Cost
$5.85
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.