Model Cost Profile

DeepSeek: R1 Distill Qwen 32B

Developer: deepseek· Tokenizer: Qwen · Instruct: deepseek-r1 · Quantization: fp8

Canonical ID: deepseek/deepseek-r1-distill-qwen-32b

Pricing updated Apr 24, 2026

Input rank: #168Output rank: #79

Live Pricing

Input: $0.2900

Output: $0.2900

Visit DeepSeek ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

DeepSeek: R1 Distill Qwen 32B is designed for applications requiring extensive context management, offering a context window of 32,768 tokens, making it ideal for complex document analysis and long-form content generation. With a competitive pricing model of $0.29 per million tokens for both input and output, teams can efficiently manage costs while leveraging the model for tasks such as customer support automation and advanced data extraction. This API model is particularly beneficial for organizations that need to process large volumes of text without sacrificing performance or incurring high operational expenses.

📋 Structured Output🧠 Reasoning

Context Window

32,768

Input tokens

Full-context input ≈ $0.01

Max Output

32,768

Completion tokens

Input Price / 1M

$0.2900

Prompt tokens

Output Price / 1M

$0.2900

Completion tokens

Top Benchmark

73.9

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for DeepSeek: R1 Distill Qwen 32B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	61.5	#63 of 125	artificial_analysis
MMLU	73.9	#79 of 127	artificial_analysis

Price History

DeepSeek: R1 Distill Qwen 32B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.2900

Current Output / 1M

$0.2900

Performance History

DeepSeek: R1 Distill Qwen 32B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2900
Output (Completion)	$0.2900

Compare with DeepSeek: DeepSeek V3.2 Exp Compare with MiniMax: MiniMax M2.1 Compare with Baidu: ERNIE 4.5 300B A47B

Cost Calculator

Estimate monthly spend for DeepSeek: R1 Distill Qwen 32B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$11

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$11 Free Models Router$0.00−$11 Google: Gemma 3 12B (free)$0.00−$11 Google: Gemma 3 27B (free)$0.00−$11

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

DeepSeek: R1 Distill Qwen 32B vs Baidu: Qianfan-OCR-Fast (free)DeepSeek: R1 Distill Qwen 32B vs Free Models Router DeepSeek: R1 Distill Qwen 32B vs Google: Gemma 3 12B (free)DeepSeek: R1 Distill Qwen 32B vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

61.5

#63 of 125

artificial_analysis

MMLU

73.9

#79 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.2900

Output (Completion)

$0.2900

Cost Calculator

Estimate monthly spend for DeepSeek: R1 Distill Qwen 32B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$11

25M input + 12M output tokens