Model Cost Profile

Qwen2.5 72B Instruct

Developer: qwen· Tokenizer: Qwen · Instruct: chatml · Quantization: fp8

Canonical ID: qwen/qwen-2.5-72b-instruct

Pricing updated Apr 25, 2026

Input rank: #101Output rank: #97

Live Pricing

Input: $0.1200

Output: $0.3900

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 25, 2026

Qwen2.5 72B Instruct is designed for advanced natural language processing tasks, making it suitable for applications such as chatbots, content generation, and data analysis. With a context window of 32,768 tokens, this model excels in handling extensive dialogues and complex queries, allowing teams to maintain context over longer interactions. Pricing for the API is competitive, with an input cost of $0.12 per million tokens and an output cost of $0.39 per million tokens, making it a cost-effective choice for organizations requiring scalable language solutions.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

32,768

Input tokens

Full-context input ≈ $0.00

Max Output

16,384

Completion tokens

Input Price / 1M

$0.1200

Prompt tokens

Output Price / 1M

$0.3900

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Qwen2.5 72B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.1200

Current Output / 1M

$0.3900

Performance History

Qwen2.5 72B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1200
Output (Completion)	$0.3900

Compare with Qwen: Qwen3 VL 8B Thinking Compare with Mistral: Mistral 7B Instruct v0.1 Compare with Google: Gemma 4 31B

Cost Calculator

Estimate monthly spend for Qwen2.5 72B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$7.68

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$7.68 Free Models Router$0.00−$7.68 Google: Gemma 3 12B (free)$0.00−$7.68 Google: Gemma 3 27B (free)$0.00−$7.68

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen2.5 72B Instruct vs Baidu: Qianfan-OCR-Fast (free)Qwen2.5 72B Instruct vs Free Models Router Qwen2.5 72B Instruct vs Google: Gemma 3 12B (free)Qwen2.5 72B Instruct vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1200

Output (Completion)

$0.3900

Cost Calculator

Estimate monthly spend for Qwen2.5 72B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$7.68

25M input + 12M output tokens