Model Cost Profile

Qwen: Qwen3 Coder Flash

Developer: qwen· Tokenizer: Qwen3 · Quantization: unknown

Canonical ID: qwen/qwen3-coder-flash

Pricing updated Apr 25, 2026

Input rank: #133Output rank: #168

Live Pricing

Input: $0.1950

Output: $0.9750

Visit Qwen ↗View full pricing leaderboard

Last synced Apr 25, 2026

Qwen: Qwen3 Coder Flash is designed for developers needing extensive context in applications like code generation and natural language processing, with a remarkable context window of 1,000,000 tokens. This model's pricing structure, at $0.30 per million input tokens and $1.50 per million output tokens, allows teams to manage costs effectively while leveraging its capabilities for large-scale projects. Ideal for enterprise-level solutions, Qwen3 Coder Flash enhances productivity in software development and data analysis by accommodating complex queries and extensive data sets.

💡 Enable prompt caching to save 80% on repeated input tokens ($0.0390/M cached vs $0.1950/M standard).

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

1,000,000

Input tokens

Full-context input ≈ $0.20

Max Output

65,536

Completion tokens

Input Price / 1M

$0.1950

Prompt tokens

Output Price / 1M

$0.9750

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Qwen: Qwen3 Coder Flash Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.1950

Current Output / 1M

$0.9750

Performance History

Qwen: Qwen3 Coder Flash Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1950
Output (Completion)	$0.9750
Cache Read	$0.0390
Cache Write	$0.24375

Compare with Qwen: Qwen3.5-27B Compare with AllenAI: Olmo 3.1 32B Instruct Compare with DeepSeek: DeepSeek V3 0324

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Coder Flash based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$17

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$17 Free Models Router$0.00−$17 Google: Gemma 3 12B (free)$0.00−$17 Google: Gemma 3 27B (free)$0.00−$17

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3 Coder Flash vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3 Coder Flash vs Free Models Router Qwen: Qwen3 Coder Flash vs Google: Gemma 3 12B (free)Qwen: Qwen3 Coder Flash vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1950

Output (Completion)

$0.9750

Cache Read

$0.0390

Cache Write

$0.24375

Cost Calculator

Estimate monthly spend for Qwen: Qwen3 Coder Flash based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$17

25M input + 12M output tokens