Context Window
262,144
Input tokens
Full-context input ≈ $0.10
Model Cost Profile
Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8
Canonical ID: qwen/qwen3.5-397b-a17b-20260216
Pricing updated Apr 22, 2026
Live Pricing
Input: $0.3900
Output: $2.34
Last synced Apr 22, 2026
Qwen3.5 397B A17B is designed for applications requiring extensive context handling, with a remarkable context window of 262,144 tokens, making it suitable for complex tasks such as document summarization and long-form content generation. Teams utilizing this API model can expect input costs of $0.15 per million tokens and output costs of $1.00 per million tokens, allowing for scalable budgeting based on usage patterns. This pricing structure is particularly advantageous for enterprises that need to process large volumes of text while maintaining high-quality output.
Context Window
262,144
Input tokens
Full-context input ≈ $0.10
Max Output
65,536
Completion tokens
Input Price / 1M
$0.3900
Prompt tokens
Output Price / 1M
$2.34
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.3900
Current Output / 1M
$2.34
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.4%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.3900 |
| Output (Completion) | $2.34 |
| Cache Read | $0.1950 |
Estimate monthly spend for Qwen: Qwen3.5 397B A17B based on your workload.
Estimated Monthly Cost
$38
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.