Context Window
32,768
Input tokens
Full-context input ≈ $0.01
Model Cost Profile
Developer: deepseek· Tokenizer: Qwen · Instruct: deepseek-r1 · Quantization: fp8
Canonical ID: deepseek/deepseek-r1-distill-qwen-32b
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.2900
Output: $0.2900
Last synced Apr 24, 2026 · MMLU score via public benchmark data
DeepSeek: R1 Distill Qwen 32B is designed for applications requiring extensive context management, offering a context window of 32,768 tokens, making it ideal for complex document analysis and long-form content generation. With a competitive pricing model of $0.29 per million tokens for both input and output, teams can efficiently manage costs while leveraging the model for tasks such as customer support automation and advanced data extraction. This API model is particularly beneficial for organizations that need to process large volumes of text without sacrificing performance or incurring high operational expenses.
Context Window
32,768
Input tokens
Full-context input ≈ $0.01
Max Output
32,768
Completion tokens
Input Price / 1M
$0.2900
Prompt tokens
Output Price / 1M
$0.2900
Completion tokens
Top Benchmark
73.9
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for DeepSeek: R1 Distill Qwen 32B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.2900
Current Output / 1M
$0.2900
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.2900 |
| Output (Completion) | $0.2900 |
Estimate monthly spend for DeepSeek: R1 Distill Qwen 32B based on your workload.
Estimated Monthly Cost
$11
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.