Model Cost Profile

Meta: Llama 3.1 405B Instruct

Developer: meta-llama

Pricing updated Mar 11, 2026

Input rank: #316Output rank: #252

Live Pricing

Input: $4.00

Output: $4.00

Get API Key View full pricing leaderboard

Pricing via OpenRouter API · Last synced Mar 11, 2026 · MMLU score via public benchmark data

Meta: Llama 3.1 405B Instruct offers a substantial context window of 131,000 tokens, making it suitable for complex applications such as long-form content generation and detailed conversational agents. With an input and output pricing structure of $4.00 per million tokens, teams can effectively manage costs while leveraging advanced AI capabilities for diverse tasks. This model is ideal for organizations requiring extensive context handling and efficient budget allocation for high-volume API usage.

🔧 Tool Calling📋 Structured Output

Context Window

131,000

Tokens

Input Price / 1M

$4.00

Prompt tokens

Output Price / 1M

$4.00

Completion tokens

Intelligence (MMLU)

73.2

Massive Multitask Language Understanding

Benchmark Scores

Standardized evaluation scores for Meta: Llama 3.1 405B Instruct.

Benchmark	Score	Rank	Source
GPQA	51.5	#83 of 118	artificial_analysis
MMLU	73.2	#74 of 121	artificial_analysis

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$4.00
Output (Completion)	$4.00

Compare with Meta: Llama 3.1 405B (base)Compare with AionLabs: Aion-1.0 Compare with Goliath 120B

Price History

Meta: Llama 3.1 405B Instruct Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Mar 7 — Mar 11

Current Input / 1M

$4.00

Current Output / 1M

$4.00

Cost Calculator

Estimate monthly spend for Meta: Llama 3.1 405B Instruct based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$148

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$148 Arcee AI: Trinity Mini (free)$0.00−$148 Google: Gemma 3 12B (free)$0.00−$148 Google: Gemma 3 27B (free)$0.00−$148

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Meta: Llama 3.1 405B Instruct vs Arcee AI: Trinity Large Preview (free)Meta: Llama 3.1 405B Instruct vs Arcee AI: Trinity Mini (free)Meta: Llama 3.1 405B Instruct vs Google: Gemma 3 12B (free)Meta: Llama 3.1 405B Instruct vs Google: Gemma 3 27B (free)

FAQ

Common pricing and benchmark questions for Meta: Llama 3.1 405B Instruct.

How much does Meta: Llama 3.1 405B Instruct cost per 1M input tokens?

Meta: Llama 3.1 405B Instruct input pricing is $4.00 per 1M tokens based on the latest synced provider data.

How much does Meta: Llama 3.1 405B Instruct cost per 1M output tokens?

Meta: Llama 3.1 405B Instruct output pricing is $4.00 per 1M tokens based on the latest synced provider data.

What context window does Meta: Llama 3.1 405B Instruct support?

Meta: Llama 3.1 405B Instruct supports a context window of 131,000 tokens.

How can I compare Meta: Llama 3.1 405B Instruct with cheaper alternatives?

Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.