Model Cost Profile

Qwen: Qwen3 Max

Developer: qwen· Tokenizer: Qwen3 · Quantization: unknown

Canonical ID: qwen/qwen3-max

Pricing updated Apr 25, 2026

Input rank: #236Output rank: #258

Live Pricing

Input: $0.7800

Output: $3.90

Visit Qwen ↗View full pricing leaderboard

Last synced Apr 25, 2026 · MMLU score via public benchmark data

Qwen3 Max by qwen offers an extensive context window of 262,144 tokens, making it ideal for applications requiring deep contextual understanding such as long-form content generation and complex conversational agents. With an input price of $1.20 per million tokens and an output price of $6.00 per million tokens, teams can effectively manage costs while leveraging this model for high-volume tasks. This pricing structure allows organizations to optimize their budget for projects that demand both extensive input and detailed output, ensuring efficient resource allocation.

💡 Enable prompt caching to save 80% on repeated input tokens ($0.1560/M cached vs $0.7800/M standard).

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output💾 Implicit Caching

Context Window

262,144

Input tokens

Full-context input ≈ $0.20

Max Output

32,768

Completion tokens

Input Price / 1M

$0.7800

Prompt tokens

Output Price / 1M

$3.90

Completion tokens

Top Benchmark

76.2

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 Max. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	58.7	#72 of 125	artificial_analysis
MMLU	76.2	#64 of 127	artificial_analysis

Price History

Qwen: Qwen3 Max Pricing Trend

Input / 1M tokens-35.0%Output / 1M tokens-35.0%

Current Input / 1M

$0.7800

Current Output / 1M

$3.90

Performance History

Qwen: Qwen3 Max Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.7800
Output (Completion)	$3.90
Cache Read	$0.1560
Cache Write	$0.9750

Compare with Qwen: Qwen3 Max Thinking Compare with AionLabs: Aion-2.0 Compare with AionLabs: Aion-RP 1.0 (8B)

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Max based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$66

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$66 Free Models Router$0.00−$66 Google: Gemma 3 12B (free)$0.00−$66 Google: Gemma 3 27B (free)$0.00−$66

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 Max vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 Max vs Free Models Router Qwen: Qwen3 Max vs Google: Gemma 3 12B (free)Qwen: Qwen3 Max vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

58.7

#72 of 125

artificial_analysis

MMLU

76.2

#64 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.7800

Output (Completion)

$3.90

Cache Read

$0.1560

Cache Write

$0.9750

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Max based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$66

25M input + 12M output tokens