Model Cost Profile

DeepSeek: R1 Distill Llama 70B

Developer: deepseek· Tokenizer: Llama3 · Instruct: deepseek-r1 · Quantization: fp8

Canonical ID: deepseek/deepseek-r1-distill-llama-70b

Pricing updated Apr 24, 2026

Input rank: #230Output rank: #156

Live Pricing

Input: $0.7000

Output: $0.8000

Visit DeepSeek ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

DeepSeek's R1 Distill Llama 70B model offers an extensive context window of 131,072 tokens, making it ideal for applications requiring in-depth analysis, such as legal document review or comprehensive research tasks. With an input price of $0.70 per million tokens and an output price of $0.80 per million tokens, teams can effectively manage costs while leveraging the model's capabilities for large-scale data processing. This model is particularly beneficial for organizations that need to handle complex queries and generate detailed responses in real-time.

📋 Structured Output🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.09

Max Output

16,384

Completion tokens

Input Price / 1M

$0.7000

Prompt tokens

Output Price / 1M

$0.8000

Completion tokens

Top Benchmark

79.5

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for DeepSeek: R1 Distill Llama 70B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	40.2	#112 of 125	artificial_analysis
MMLU	79.5	#48 of 127	artificial_analysis

Price History

DeepSeek: R1 Distill Llama 70B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.7000

Current Output / 1M

$0.8000

Performance History

DeepSeek: R1 Distill Llama 70B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.7000
Output (Completion)	$0.8000

Compare with DeepSeek: R1 Compare with AionLabs: Aion-1.0-Mini Compare with Qwen2.5 Coder 32B Instruct

Cost Calculator

Estimate monthly spend for DeepSeek: R1 Distill Llama 70B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$27

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$27 Free Models Router$0.00−$27 Google: Gemma 3 12B (free)$0.00−$27 Google: Gemma 3 27B (free)$0.00−$27

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

DeepSeek: R1 Distill Llama 70B vs Baidu: Qianfan-OCR-Fast (free)DeepSeek: R1 Distill Llama 70B vs Free Models Router DeepSeek: R1 Distill Llama 70B vs Google: Gemma 3 12B (free)DeepSeek: R1 Distill Llama 70B vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

40.2

#112 of 125

artificial_analysis

MMLU

79.5

#48 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.7000

Output (Completion)

$0.8000

Cost Calculator

Estimate monthly spend for DeepSeek: R1 Distill Llama 70B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$27

25M input + 12M output tokens