Model Cost Profile

Qwen: Qwen3 235B A22B Thinking 2507

Developer: qwen· Tokenizer: Qwen3 · Instruct: qwen3 · Quantization: unknown

Canonical ID: qwen/qwen3-235b-a22b-thinking-2507

Pricing updated Apr 25, 2026

Input rank: #114Output rank: #198

Live Pricing

Input: $0.1495

Output: $1.50

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 25, 2026 · MMLU score via public benchmark data

Qwen3 235B A22B Thinking 2507, developed by qwen, offers a substantial context window of 131,072 tokens, making it suitable for complex applications such as long-form content generation and large-scale data analysis. With a pricing structure that charges $0.00 per million tokens for both input and output, this model is particularly advantageous for teams looking to scale their operations without incurring significant costs. Its unique capabilities allow for effective handling of extensive datasets, making it ideal for industries requiring deep insights from large volumes of text.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.02

Max Output

—

Not specified

Input Price / 1M

$0.1495

Prompt tokens

Output Price / 1M

$1.50

Completion tokens

Top Benchmark

84.3

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen: Qwen3 235B A22B Thinking 2507. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	79.0	#19 of 125	artificial_analysis
MMLU	84.3	#14 of 127	artificial_analysis

Price History

Qwen: Qwen3 235B A22B Thinking 2507 Pricing Trend

Input / 1M tokens+35.9%Output / 1M tokens+149.2%

Current Input / 1M

$0.1495

Current Output / 1M

$1.50

Performance History

Qwen: Qwen3 235B A22B Thinking 2507 Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1495
Output (Completion)	$1.50

Compare with Qwen: QwQ 32B Compare with AllenAI: Olmo 3 32B Think Compare with Arcee AI: Trinity Large Preview

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 235B A22B Thinking 2507 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$22

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$22 Free Models Router$0.00−$22 Google: Gemma 3 12B (free)$0.00−$22 Google: Gemma 3 27B (free)$0.00−$22

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 235B A22B Thinking 2507 vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 235B A22B Thinking 2507 vs Free Models Router Qwen: Qwen3 235B A22B Thinking 2507 vs Google: Gemma 3 12B (free)Qwen: Qwen3 235B A22B Thinking 2507 vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

79.0

#19 of 125

artificial_analysis

MMLU

84.3

#14 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1495

Output (Completion)

$1.50

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 235B A22B Thinking 2507 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$22

25M input + 12M output tokens