Model Cost Profile

Meta: Llama 3.3 70B Instruct (free)

Developer: meta-llama· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8

Canonical ID: meta-llama/llama-3.3-70b-instruct

Pricing updated Apr 24, 2026

Input rank: #17Output rank: #17

Live Pricing

Input: $0.0000

Output: $0.0000

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026

Meta: Llama 3.3 70B Instruct is a powerful AI model designed for tasks such as natural language understanding, text generation, and conversational agents, making it suitable for a variety of applications in customer service and content creation. With an extensive context window of 128,000 tokens, teams can utilize this model for complex tasks that require understanding long documents or maintaining context over extended interactions. As a free API model, it offers significant cost savings for teams, eliminating input and output charges, which can enhance budget flexibility for projects requiring high-volume data processing.

🔧 Tool Calling🔌 MCP Compatible

Context Window

65,536

Input tokens

Full-context input ≈ $0.00

Max Output

—

Not specified

Input Price / 1M

$0.0000

Prompt tokens

Output Price / 1M

$0.0000

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Meta: Llama 3.3 70B Instruct (free) Pricing Trend

Input / 1M tokensOutput / 1M tokens

Current Input / 1M

$0.000000

Current Output / 1M

$0.000000

Performance History

Meta: Llama 3.3 70B Instruct (free) Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

94.7%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0000
Output (Completion)	$0.0000

Compare with Meta: Llama 3.2 3B Instruct (free)Compare with Baidu: Qianfan-OCR-Fast (free)Compare with Free Models Router

Cost Calculator

Estimate monthly spend for Meta: Llama 3.3 70B Instruct (free) based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$0.00

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00 Free Models Router$0.00 Google: Gemma 3 12B (free)$0.00 Google: Gemma 3 27B (free)$0.00

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Meta: Llama 3.3 70B Instruct (free) vs Baidu: Qianfan-OCR-Fast (free)Meta: Llama 3.3 70B Instruct (free) vs Free Models Router Meta: Llama 3.3 70B Instruct (free) vs Google: Gemma 3 12B (free)Meta: Llama 3.3 70B Instruct (free) vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0000

Output (Completion)

$0.0000

Cost Calculator

Estimate monthly spend for Meta: Llama 3.3 70B Instruct (free) based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$0.00

25M input + 12M output tokens