Model Cost Profile

Qwen: Qwen3 Next 80B A3B Thinking

Developer: qwen· Tokenizer: Qwen3 · Quantization: unknown

Canonical ID: qwen/qwen3-next-80b-a3b-thinking-2509

Pricing updated Apr 25, 2026

Input rank: #80Output rank: #154

Live Pricing

Input: $0.0975

Output: $0.7800

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 25, 2026 · MMLU score via public benchmark data

Qwen3 Next 80B A3B Thinking, developed by qwen, offers a substantial context window of 128,000 tokens, making it ideal for complex applications such as long-form content generation and detailed data analysis. With an input price of $0.15 per 1 million tokens and an output price of $1.20 per 1 million tokens, teams can effectively manage their budget while leveraging the model for extensive projects. This pricing structure allows organizations to scale their usage according to specific needs, optimizing costs for high-volume tasks.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.01

Max Output

32,768

Completion tokens

Input Price / 1M

$0.0975

Prompt tokens

Output Price / 1M

$0.7800

Completion tokens

Top Benchmark

82.4

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 Next 80B A3B Thinking. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	75.9	#35 of 125	artificial_analysis
MMLU	82.4	#28 of 127	artificial_analysis

Price History

Qwen: Qwen3 Next 80B A3B Thinking Pricing Trend

Input / 1M tokens-35.0%Output / 1M tokens-35.0%

Current Input / 1M

$0.0975

Current Output / 1M

$0.7800

Performance History

Qwen: Qwen3 Next 80B A3B Thinking Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0975
Output (Completion)	$0.7800

Compare with Qwen: Qwen3.5-9B Compare with ByteDance Seed: Seed-2.0-Mini Compare with ByteDance: UI-TARS 7B

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Next 80B A3B Thinking based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$12

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$12 Free Models Router$0.00−$12 Google: Gemma 3 12B (free)$0.00−$12 Google: Gemma 3 27B (free)$0.00−$12

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 Next 80B A3B Thinking vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 Next 80B A3B Thinking vs Free Models Router Qwen: Qwen3 Next 80B A3B Thinking vs Google: Gemma 3 12B (free)Qwen: Qwen3 Next 80B A3B Thinking vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

75.9

#35 of 125

artificial_analysis

MMLU

82.4

#28 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0975

Output (Completion)

$0.7800

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Next 80B A3B Thinking based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$12

25M input + 12M output tokens