Model Cost Profile

Qwen: Qwen3 Max Thinking

Developer: qwen· Tokenizer: Qwen · Quantization: unknown

Canonical ID: qwen/qwen3-max-thinking-20260123

Pricing updated Apr 24, 2026

Input rank: #236Output rank: #257

Live Pricing

Input: $0.7800

Output: $3.90

Visit Qwen ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Qwen3 Max Thinking, developed by Qwen, offers an extensive context window of 262,144 tokens, making it ideal for applications requiring deep contextual understanding, such as legal document analysis or complex data summarization. With an input price of $1.20 per million tokens and an output price of $6.00 per million tokens, teams can effectively manage costs based on their specific usage patterns and project requirements. This model's high token capacity allows for more comprehensive interactions, which is beneficial for industries such as finance and research where detailed insights are crucial.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

262,144

Input tokens

Full-context input ≈ $0.20

Max Output

32,768

Completion tokens

Input Price / 1M

$0.7800

Prompt tokens

Output Price / 1M

$3.90

Completion tokens

Top Benchmark

82.4

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 Max Thinking. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	77.6	#25 of 125	artificial_analysis
MMLU	82.4	#26 of 127	artificial_analysis

Price History

Qwen: Qwen3 Max Thinking Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.7800

Current Output / 1M

$3.90

Performance History

Qwen: Qwen3 Max Thinking Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.7800
Output (Completion)	$3.90

Compare with Qwen: Qwen3 Max Compare with AionLabs: Aion-2.0 Compare with AionLabs: Aion-RP 1.0 (8B)

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Max Thinking based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$66

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$66 Free Models Router$0.00−$66 Google: Gemma 3 12B (free)$0.00−$66 Google: Gemma 3 27B (free)$0.00−$66

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 Max Thinking vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 Max Thinking vs Free Models Router Qwen: Qwen3 Max Thinking vs Google: Gemma 3 12B (free)Qwen: Qwen3 Max Thinking vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

77.6

#25 of 125

artificial_analysis

MMLU

82.4

#26 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.7800

Output (Completion)

$3.90

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Max Thinking based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$66

25M input + 12M output tokens