Context Window
1,000,000
Input tokens
Full-context input โ $0.20
Model Cost Profile
Developer: qwenยท Tokenizer: Qwen3 ยท Quantization: unknown
Canonical ID: qwen/qwen3-coder-flash
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.1950
Output: $0.9750
Last synced Apr 24, 2026
Qwen: Qwen3 Coder Flash is designed for developers needing extensive context in applications like code generation and natural language processing, with a remarkable context window of 1,000,000 tokens. This model's pricing structure, at $0.30 per million input tokens and $1.50 per million output tokens, allows teams to manage costs effectively while leveraging its capabilities for large-scale projects. Ideal for enterprise-level solutions, Qwen3 Coder Flash enhances productivity in software development and data analysis by accommodating complex queries and extensive data sets.
Context Window
1,000,000
Input tokens
Full-context input โ $0.20
Max Output
65,536
Completion tokens
Input Price / 1M
$0.1950
Prompt tokens
Output Price / 1M
$0.9750
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.1950
Current Output / 1M
$0.9750
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1950 |
| Output (Completion) | $0.9750 |
| Cache Read | $0.0390 |
| Cache Write | $0.24375 |
Estimate monthly spend for Qwen: Qwen3 Coder Flash based on your workload.
Estimated Monthly Cost
$17
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.