Model Cost Profile

Qwen: Qwen-Plus

Developer: qwen· Tokenizer: Qwen · Quantization: unknown

Canonical ID: qwen/qwen-plus-2025-01-25

Pricing updated Apr 22, 2026

Input rank: #159Output rank: #150

Live Pricing

Input: $0.2600

Output: $0.7800

Visit Qwen ↗View full pricing leaderboard

Last synced Apr 22, 2026

Qwen-Plus, developed by qwen, offers an extensive context window of 1,000,000 tokens, making it suitable for applications requiring deep contextual understanding, such as long-form content generation and complex data analysis. Teams utilizing this API model can expect input costs of $0.40 per million tokens and output costs of $1.20 per million tokens, allowing for scalable budgeting based on usage patterns. This pricing structure enables organizations to effectively manage expenses while leveraging the model's capabilities for tasks like conversational AI and extensive document summarization.

💡 Enable prompt caching to save 80% on repeated input tokens ($0.0520/M cached vs $0.2600/M standard).

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

1,000,000

Input tokens

Full-context input ≈ $0.26

Max Output

32,768

Completion tokens

Input Price / 1M

$0.2600

Prompt tokens

Output Price / 1M

$0.7800

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Qwen: Qwen-Plus Pricing Trend

Input / 1M tokens-35.0%Output / 1M tokens-35.0%

Current Input / 1M

$0.2600

Current Output / 1M

$0.7800

Performance History

Qwen: Qwen-Plus Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2600
Output (Completion)	$0.7800
Cache Read	$0.0520
Cache Write	$0.3250

Compare with Qwen: Qwen Plus 0728 Compare with Qwen: Qwen Plus 0728 (thinking)Compare with Qwen: Qwen3 VL 235B A22B Thinking

Cost Calculator

Estimate monthly spend for Qwen: Qwen-Plus based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$16

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$16 Free Models Router$0.00−$16 Google: Gemma 3 12B (free)$0.00−$16 Google: Gemma 3 27B (free)$0.00−$16

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen-Plus vs Arcee AI: Trinity Large Preview (free)Qwen: Qwen-Plus vs Free Models Router Qwen: Qwen-Plus vs Google: Gemma 3 12B (free)Qwen: Qwen-Plus vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.2600

Output (Completion)

$0.7800

Cache Read

$0.0520

Cache Write

$0.3250