Context Window
65,536
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8
Canonical ID: meta-llama/llama-3.3-70b-instruct
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0000
Output: $0.0000
Last synced Apr 24, 2026
Meta: Llama 3.3 70B Instruct is a powerful AI model designed for tasks such as natural language understanding, text generation, and conversational agents, making it suitable for a variety of applications in customer service and content creation. With an extensive context window of 128,000 tokens, teams can utilize this model for complex tasks that require understanding long documents or maintaining context over extended interactions. As a free API model, it offers significant cost savings for teams, eliminating input and output charges, which can enhance budget flexibility for projects requiring high-volume data processing.
Context Window
65,536
Input tokens
Full-context input ≈ $0.00
Max Output
—
Not specified
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
94.7%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Estimate monthly spend for Meta: Llama 3.3 70B Instruct (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.