Model Cost Profile

Qwen: Qwen3.5 397B A17B

Developer: qwen· Tokenizer: Qwen3 · Quantization: fp8

Canonical ID: qwen/qwen3.5-397b-a17b-20260216

Pricing updated Apr 25, 2026

Input rank: #188Output rank: #237

Live Pricing

Input: $0.3900

Output: $2.34

Visit Qwen ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 25, 2026

Qwen3.5 397B A17B is designed for applications requiring extensive context handling, with a remarkable context window of 262,144 tokens, making it suitable for complex tasks such as document summarization and long-form content generation. Teams utilizing this API model can expect input costs of $0.15 per million tokens and output costs of $1.00 per million tokens, allowing for scalable budgeting based on usage patterns. This pricing structure is particularly advantageous for enterprises that need to process large volumes of text while maintaining high-quality output.

💡 Enable prompt caching to save 50% on repeated input tokens ($0.1950/M cached vs $0.3900/M standard).

👁 Vision🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

262,144

Input tokens

Full-context input ≈ $0.10

Max Output

65,536

Completion tokens

Input Price / 1M

$0.3900

Prompt tokens

Output Price / 1M

$2.34

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Qwen: Qwen3.5 397B A17B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.3900

Current Output / 1M

$2.34

Performance History

Qwen: Qwen3.5 397B A17B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

99.9%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.3900
Output (Completion)	$2.34
Cache Read	$0.1950

Compare with Qwen: Qwen3 235B A22B Compare with Z.ai: GLM 4.6 Compare with DeepSeek: DeepSeek V3.2 Speciale

Cost Calculator

Estimate monthly spend for Qwen: Qwen3.5 397B A17B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$38

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$38 Free Models Router$0.00−$38 Google: Gemma 3 12B (free)$0.00−$38 Google: Gemma 3 27B (free)$0.00−$38

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Qwen: Qwen3.5 397B A17B vs Baidu: Qianfan-OCR-Fast (free)Qwen: Qwen3.5 397B A17B vs Free Models Router Qwen: Qwen3.5 397B A17B vs Google: Gemma 3 12B (free)Qwen: Qwen3.5 397B A17B vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.3900

Output (Completion)

$2.34

Cache Read

$0.1950

Cost Calculator

Estimate monthly spend for Qwen: Qwen3.5 397B A17B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$38

25M input + 12M output tokens