Context Window
128,000
Tokens
Model Cost Profile
Developer: meta-llama
Pricing updated Mar 11, 2026
Meta: Llama 3.3 70B Instruct is a powerful AI model designed for tasks such as natural language understanding, text generation, and conversational agents, making it suitable for a variety of applications in customer service and content creation. With an extensive context window of 128,000 tokens, teams can utilize this model for complex tasks that require understanding long documents or maintaining context over extended interactions. As a free API model, it offers significant cost savings for teams, eliminating input and output charges, which can enhance budget flexibility for projects requiring high-volume data processing.
Context Window
128,000
Tokens
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Estimate monthly spend for Meta: Llama 3.3 70B Instruct (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Meta: Llama 3.3 70B Instruct (free).
Meta: Llama 3.3 70B Instruct (free) input pricing is $0.0000 per 1M tokens based on the latest synced provider data.
Meta: Llama 3.3 70B Instruct (free) output pricing is $0.0000 per 1M tokens based on the latest synced provider data.
Meta: Llama 3.3 70B Instruct (free) supports a context window of 128,000 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.