Context Window
32,768
Input tokens
Full-context input ≈ $0.02
Model Cost Profile
Developer: qwen· Tokenizer: Qwen · Instruct: chatml · Quantization: unknown
Canonical ID: qwen/qwen-2.5-coder-32b-instruct
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.6600
Output: $1.00
Last synced Apr 25, 2026 · MMLU score via public benchmark data
Qwen2.5 Coder 32B Instruct is designed for teams requiring advanced coding assistance, capable of handling complex programming tasks with a context window of 32,768 tokens. This model's pricing structure, at $0.20 per million tokens for both input and output, makes it a cost-effective choice for projects with high token usage. Ideal for software development, debugging, and code generation, Qwen2.5 Coder can enhance productivity while managing budget constraints effectively.
Context Window
32,768
Input tokens
Full-context input ≈ $0.02
Max Output
—
Not specified
Input Price / 1M
$0.6600
Prompt tokens
Output Price / 1M
$1.00
Completion tokens
Top Benchmark
63.5
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Qwen2.5 Coder 32B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.6600
Current Output / 1M
$1.00
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.6600 |
| Output (Completion) | $1.00 |
Estimate monthly spend for Qwen2.5 Coder 32B Instruct based on your workload.
Estimated Monthly Cost
$29
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.