Model Cost Profile

Meta: Llama 3.1 70B Instruct

Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8

Canonical ID: meta-llama/llama-3.1-70b-instruct

Pricing updated Apr 25, 2026

Input rank: #191Output rank: #102

Live Pricing

Input: $0.4000

Output: $0.4000

HuggingFace ↗View full pricing leaderboard

Last synced Apr 25, 2026

Meta: Llama 3.1 70B Instruct is designed for complex instruction-following tasks, making it suitable for applications in customer support automation and content generation. With a context window of 131,072 tokens, this model can handle extensive dialogues and large documents, providing teams with the ability to maintain context over longer interactions. The pricing structure at $0.40 per million tokens for both input and output allows organizations to budget effectively while scaling their usage based on project demands.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output

Context Window

131,072

Input tokens

Full-context input ≈ $0.05

Max Output

16,384

Completion tokens

Input Price / 1M

$0.4000

Prompt tokens

Output Price / 1M

$0.4000

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Meta: Llama 3.1 70B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.4000

Current Output / 1M

$0.4000

Performance History

Meta: Llama 3.1 70B Instruct Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.4000
Output (Completion)	$0.4000

Compare with Llama Guard 3 8B Compare with DeepSeek: DeepSeek V3.2 Speciale Compare with MiniMax: MiniMax M1

Cost Calculator

Estimate monthly spend for Meta: Llama 3.1 70B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$15

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$15 Free Models Router$0.00−$15 Google: Gemma 3 12B (free)$0.00−$15 Google: Gemma 3 27B (free)$0.00−$15

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Meta: Llama 3.1 70B Instruct vs Baidu: Qianfan-OCR-Fast (free)Meta: Llama 3.1 70B Instruct vs Free Models Router Meta: Llama 3.1 70B Instruct vs Google: Gemma 3 12B (free)Meta: Llama 3.1 70B Instruct vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.4000

Output (Completion)

$0.4000

Cost Calculator

Estimate monthly spend for Meta: Llama 3.1 70B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$15

25M input + 12M output tokens