Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8
Canonical ID: qwen/qwen3-next-80b-a3b-instruct-2509
Pricing updated Apr 23, 2026
Live Pricing
Input: $0.0900
Output: $1.10
Last synced Apr 23, 2026 · MMLU score via public benchmark data
Qwen3 Next 80B A3B Instruct by Qwen is designed for complex tasks requiring extensive context, accommodating a remarkable 262,144 tokens, making it ideal for applications in natural language understanding and large-scale document processing. With an input price of $0.09 per 1M tokens and an output price of $1.10 per 1M tokens, teams can effectively budget for high-volume usage while leveraging its advanced capabilities for customer support automation and content generation. This model's pricing structure allows organizations to optimize costs based on their specific use cases, ensuring a balance between performance and expenditure.
Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Max Output
—
Not specified
Input Price / 1M
$0.0900
Prompt tokens
Output Price / 1M
$1.10
Completion tokens
Top Benchmark
81.9
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: Qwen3 Next 80B A3B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0900
Current Output / 1M
$1.10
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0900 |
| Output (Completion) | $1.10 |
Estimate monthly spend for Qwen: Qwen3 Next 80B A3B Instruct based on your workload.
Estimated Monthly Cost
$15
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.