Context Window
8,192
Input tokens
Full-context input ≈ $0.01
Model Cost Profile
Developer: sao10k· Tokenizer: Llama3 · Instruct: llama3 · Quantization: bf16
Canonical ID: sao10k/l3-euryale-70b
Pricing updated Apr 22, 2026
Live Pricing
Input: $1.48
Output: $1.48
Last synced Apr 22, 2026
Sao10k's Llama 3 Euryale 70B v2.1 model features an extensive context window of 8192 tokens, making it suitable for applications requiring detailed text analysis, such as document summarization and conversational AI. With a competitive pricing structure of $1.48 per million tokens for both input and output, teams can effectively manage costs while scaling their usage based on project demands. This model's capabilities are ideal for organizations looking to enhance their natural language processing tasks without incurring significant financial overhead.
Context Window
8,192
Input tokens
Full-context input ≈ $0.01
Max Output
8,192
Completion tokens
Input Price / 1M
$1.48
Prompt tokens
Output Price / 1M
$1.48
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$1.48
Current Output / 1M
$1.48
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $1.48 |
| Output (Completion) | $1.48 |
Estimate monthly spend for Sao10k: Llama 3 Euryale 70B v2.1 based on your workload.
Estimated Monthly Cost
$55
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.