Model Cost Profile

Qwen: Qwen3 Next 80B A3B Instruct

Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8

Canonical ID: qwen/qwen3-next-80b-a3b-instruct-2509

Pricing updated Apr 23, 2026

Input rank: #75Output rank: #176

Live Pricing

Input: $0.0900

Output: $1.10

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 23, 2026 · MMLU score via public benchmark data

Qwen3 Next 80B A3B Instruct by Qwen is designed for complex tasks requiring extensive context, accommodating a remarkable 262,144 tokens, making it ideal for applications in natural language understanding and large-scale document processing. With an input price of $0.09 per 1M tokens and an output price of $1.10 per 1M tokens, teams can effectively budget for high-volume usage while leveraging its advanced capabilities for customer support automation and content generation. This model's pricing structure allows organizations to optimize costs based on their specific use cases, ensuring a balance between performance and expenditure.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

262,144

Input tokens

Full-context input ≈ $0.02

Max Output

—

Not specified

Input Price / 1M

$0.0900

Prompt tokens

Output Price / 1M

$1.10

Completion tokens

Top Benchmark

81.9

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 Next 80B A3B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	73.8	#42 of 125	artificial_analysis
MMLU	81.9	#36 of 127	artificial_analysis

Price History

Qwen: Qwen3 Next 80B A3B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0900

Current Output / 1M

$1.10

Performance History

Qwen: Qwen3 Next 80B A3B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0900
Output (Completion)	$1.10

Compare with Qwen: Qwen3 30B A3B Instruct 2507 Compare with NVIDIA: Nemotron 3 Super Compare with Tongyi DeepResearch 30B A3B

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Next 80B A3B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$15

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$15 Free Models Router$0.00−$15 Google: Gemma 3 12B (free)$0.00−$15 Google: Gemma 3 27B (free)$0.00−$15

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 Next 80B A3B Instruct vs Arcee AI: Trinity Large Preview (free)Qwen: Qwen3 Next 80B A3B Instruct vs Free Models Router Qwen: Qwen3 Next 80B A3B Instruct vs Google: Gemma 3 12B (free)Qwen: Qwen3 Next 80B A3B Instruct vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

73.8

#42 of 125

artificial_analysis

MMLU

81.9

#36 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0900

Output (Completion)

$1.10

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Next 80B A3B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$15

25M input + 12M output tokens