Model Cost Profile

Qwen: Qwen2.5 7B Instruct

Developer: qwen· Tokenizer: Qwen · Instruct: chatml · Quantization: unknown

Canonical ID: qwen/qwen-2.5-7b-instruct

Pricing updated Apr 24, 2026

Input rank: #45Output rank: #40

Live Pricing

Input: $0.0400

Output: $0.1000

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026

Qwen2.5 7B Instruct by qwen is designed for applications requiring extensive context, with a generous context window of 32,768 tokens, making it suitable for complex tasks such as document summarization and conversational AI. Teams leveraging this API model can expect an input cost of $0.04 per million tokens and an output cost of $0.10 per million tokens, which can significantly impact budget planning based on usage patterns. This model is particularly advantageous for organizations that need to process large volumes of text while maintaining high-quality outputs in real-time applications.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

32,768

Input tokens

Full-context input ≈ $0.00

Max Output

32,768

Completion tokens

Input Price / 1M

$0.0400

Prompt tokens

Output Price / 1M

$0.1000

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Qwen: Qwen2.5 7B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0400

Current Output / 1M

$0.1000

Performance History

Qwen: Qwen2.5 7B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

99.9%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0400
Output (Completion)	$0.1000

Compare with Qwen: Qwen-Turbo Compare with Google: Gemma 3 12B Compare with Google: Gemma 3 4B

Cost Calculator

Estimate monthly spend for Qwen: Qwen2.5 7B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$2.20

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$2.20 Free Models Router$0.00−$2.20 Google: Gemma 3 12B (free)$0.00−$2.20 Google: Gemma 3 27B (free)$0.00−$2.20

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen2.5 7B Instruct vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen2.5 7B Instruct vs Free Models Router Qwen: Qwen2.5 7B Instruct vs Google: Gemma 3 12B (free)Qwen: Qwen2.5 7B Instruct vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0400

Output (Completion)

$0.1000

Cost Calculator

Estimate monthly spend for Qwen: Qwen2.5 7B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$2.20

25M input + 12M output tokens