Model Cost Profile

Qwen2.5 Coder 32B Instruct

Developer: qwen· Tokenizer: Qwen · Instruct: chatml · Quantization: unknown

Canonical ID: qwen/qwen-2.5-coder-32b-instruct

Pricing updated Apr 25, 2026

Input rank: #228Output rank: #174

Live Pricing

Input: $0.6600

Output: $1.00

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 25, 2026 · MMLU score via public benchmark data

Qwen2.5 Coder 32B Instruct is designed for teams requiring advanced coding assistance, capable of handling complex programming tasks with a context window of 32,768 tokens. This model's pricing structure, at $0.20 per million tokens for both input and output, makes it a cost-effective choice for projects with high token usage. Ideal for software development, debugging, and code generation, Qwen2.5 Coder can enhance productivity while managing budget constraints effectively.

Context Window

32,768

Input tokens

Full-context input ≈ $0.02

Max Output

—

Not specified

Input Price / 1M

$0.6600

Prompt tokens

Output Price / 1M

$1.00

Completion tokens

Top Benchmark

63.5

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Qwen2.5 Coder 32B Instruct. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	41.7	#111 of 125	artificial_analysis
MMLU	63.5	#112 of 127	artificial_analysis

Price History

Qwen2.5 Coder 32B Instruct Pricing Trend

Input / 1M tokens+230.0%Output / 1M tokens+400.0%

Current Input / 1M

$0.6600

Current Output / 1M

$1.00

Performance History

Qwen2.5 Coder 32B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.6600
Output (Completion)	$1.00

Compare with Qwen: Qwen3 Coder Plus Compare with Google: Gemma 2 27B Compare with Sao10K: Llama 3.3 Euryale 70B

Cost Calculator

Estimate monthly spend for Qwen2.5 Coder 32B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$29

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$29 Free Models Router$0.00−$29 Google: Gemma 3 12B (free)$0.00−$29 Google: Gemma 3 27B (free)$0.00−$29

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen2.5 Coder 32B Instruct vs Baidu: Qianfan-OCR-Fast (free)Qwen2.5 Coder 32B Instruct vs Free Models Router Qwen2.5 Coder 32B Instruct vs Google: Gemma 3 12B (free)Qwen2.5 Coder 32B Instruct vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

41.7

#111 of 125

artificial_analysis

MMLU

63.5

#112 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.6600

Output (Completion)

$1.00

Cost Calculator

Estimate monthly spend for Qwen2.5 Coder 32B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$29

25M input + 12M output tokens