Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8
Canonical ID: qwen/qwen3-235b-a22b-07-25
Pricing updated Apr 22, 2026
Live Pricing
Input: $0.0710
Output: $0.1000
Last synced Apr 22, 2026 · MMLU score via public benchmark data
Qwen3 235B A22B Instruct 2507, developed by Qwen, is designed for complex tasks requiring extensive context, making it ideal for applications in natural language understanding, content generation, and conversational AI. With a context window of 262,144 tokens, teams can leverage this model for projects that demand deep contextual awareness, such as long-form content creation or detailed dialogue systems. The pricing structure, at $0.07 per 1M tokens for input and $0.10 per 1M tokens for output, allows organizations to budget effectively based on their usage patterns and expected workload.
Context Window
262,144
Input tokens
Full-context input ≈ $0.02
Max Output
—
Not specified
Input Price / 1M
$0.0710
Prompt tokens
Output Price / 1M
$0.1000
Completion tokens
Top Benchmark
82.8
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen: Qwen3 235B A22B Instruct 2507. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0710
Current Output / 1M
$0.1000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
97.9%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0710 |
| Output (Completion) | $0.1000 |
Estimate monthly spend for Qwen: Qwen3 235B A22B Instruct 2507 based on your workload.
Estimated Monthly Cost
$2.97
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.