Context Window
131,072
Tokens
Model Cost Profile
Developer: meta-llama
Pricing updated Mar 11, 2026
Meta: Llama 3.1 70B Instruct is designed for complex instruction-following tasks, making it suitable for applications in customer support automation and content generation. With a context window of 131,072 tokens, this model can handle extensive dialogues and large documents, providing teams with the ability to maintain context over longer interactions. The pricing structure at $0.40 per million tokens for both input and output allows organizations to budget effectively while scaling their usage based on project demands.
Context Window
131,072
Tokens
Input Price / 1M
$0.4000
Prompt tokens
Output Price / 1M
$0.4000
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.4000 |
| Output (Completion) | $0.4000 |
Price History
Current Input / 1M
$0.4000
Current Output / 1M
$0.4000
Estimate monthly spend for Meta: Llama 3.1 70B Instruct based on your workload.
Estimated Monthly Cost
$15
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Meta: Llama 3.1 70B Instruct.
Meta: Llama 3.1 70B Instruct input pricing is $0.4000 per 1M tokens based on the latest synced provider data.
Meta: Llama 3.1 70B Instruct output pricing is $0.4000 per 1M tokens based on the latest synced provider data.
Meta: Llama 3.1 70B Instruct supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.