Context Window
131,072
Input tokens
Full-context input ≈ $0.01
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Quantization: unknown
Canonical ID: qwen/qwen3-next-80b-a3b-thinking-2509
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.0975
Output: $0.7800
Last synced Apr 25, 2026 · MMLU score via public benchmark data
Qwen3 Next 80B A3B Thinking, developed by qwen, offers a substantial context window of 128,000 tokens, making it ideal for complex applications such as long-form content generation and detailed data analysis. With an input price of $0.15 per 1 million tokens and an output price of $1.20 per 1 million tokens, teams can effectively manage their budget while leveraging the model for extensive projects. This pricing structure allows organizations to scale their usage according to specific needs, optimizing costs for high-volume tasks.
Context Window
131,072
Input tokens
Full-context input ≈ $0.01
Max Output
32,768
Completion tokens
Input Price / 1M
$0.0975
Prompt tokens
Output Price / 1M
$0.7800
Completion tokens
Top Benchmark
82.4
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: Qwen3 Next 80B A3B Thinking. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0975
Current Output / 1M
$0.7800
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0975 |
| Output (Completion) | $0.7800 |
Estimate monthly spend for Qwen: Qwen3 Next 80B A3B Thinking based on your workload.
Estimated Monthly Cost
$12
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.