Model Cost Profile

Qwen: Qwen3 235B A22B Instruct 2507

Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8

Canonical ID: qwen/qwen3-235b-a22b-07-25

Pricing updated Apr 22, 2026

Input rank: #62Output rank: #39

Live Pricing

Input: $0.0710

Output: $0.1000

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 22, 2026 · MMLU score via public benchmark data

Qwen3 235B A22B Instruct 2507, developed by Qwen, is designed for complex tasks requiring extensive context, making it ideal for applications in natural language understanding, content generation, and conversational AI. With a context window of 262,144 tokens, teams can leverage this model for projects that demand deep contextual awareness, such as long-form content creation or detailed dialogue systems. The pricing structure, at $0.07 per 1M tokens for input and $0.10 per 1M tokens for output, allows organizations to budget effectively based on their usage patterns and expected workload.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

262,144

Input tokens

Full-context input ≈ $0.02

Max Output

—

Not specified

Input Price / 1M

$0.0710

Prompt tokens

Output Price / 1M

$0.1000

Completion tokens

Top Benchmark

82.8

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 235B A22B Instruct 2507. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	75.3	#35 of 125	artificial_analysis
MMLU	82.8	#25 of 127	artificial_analysis

Price History

Qwen: Qwen3 235B A22B Instruct 2507 Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0710

Current Output / 1M

$0.1000

Performance History

Qwen: Qwen3 235B A22B Instruct 2507 Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

97.9%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0710
Output (Completion)	$0.1000

Compare with Qwen: Qwen3 Coder 30B A3B Instruct Compare with Baidu: ERNIE 4.5 21B A3B Compare with Baidu: ERNIE 4.5 21B A3B Thinking

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 235B A22B Instruct 2507 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$2.97

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$2.97 Free Models Router$0.00−$2.97 Google: Gemma 3 12B (free)$0.00−$2.97 Google: Gemma 3 27B (free)$0.00−$2.97

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 235B A22B Instruct 2507 vs Arcee AI: Trinity Large Preview (free)Qwen: Qwen3 235B A22B Instruct 2507 vs Free Models Router Qwen: Qwen3 235B A22B Instruct 2507 vs Google: Gemma 3 12B (free)Qwen: Qwen3 235B A22B Instruct 2507 vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

75.3

#35 of 125

artificial_analysis

MMLU

82.8

#25 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0710

Output (Completion)

$0.1000

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 235B A22B Instruct 2507 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$2.97

25M input + 12M output tokens