Context Window
32,768
Tokens
Model Cost Profile
Developer: meta-llama
Pricing updated Mar 11, 2026
Meta: Llama 3.1 405B (base) offers a substantial context window of 32,768 tokens, making it suitable for applications requiring extensive text understanding, such as document summarization and conversational AI. With a competitive pricing structure of $4.00 per million tokens for both input and output, teams can effectively manage costs while scaling their usage based on project needs. This model is ideal for organizations looking to integrate advanced natural language processing capabilities into their workflows without incurring prohibitive expenses.
Context Window
32,768
Tokens
Input Price / 1M
$4.00
Prompt tokens
Output Price / 1M
$4.00
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $4.00 |
| Output (Completion) | $4.00 |
Price History
Current Input / 1M
$4.00
Current Output / 1M
$4.00
Estimate monthly spend for Meta: Llama 3.1 405B (base) based on your workload.
Estimated Monthly Cost
$148
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Meta: Llama 3.1 405B (base).
Meta: Llama 3.1 405B (base) input pricing is $4.00 per 1M tokens based on the latest synced provider data.
Meta: Llama 3.1 405B (base) output pricing is $4.00 per 1M tokens based on the latest synced provider data.
Meta: Llama 3.1 405B (base) supports a context window of 32,768 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.