Context Window
131,072
Input tokens
Full-context input ≈ $0.11
Model Cost Profile
Developer: sao10k· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8
Canonical ID: sao10k/l3.1-euryale-70b
Pricing updated Apr 23, 2026
Live Pricing
Input: $0.8500
Output: $0.8500
Last synced Apr 23, 2026
Sao10K's Llama 3.1 Euryale 70B v2.2 model features an extensive context window of 32,768 tokens, making it ideal for applications requiring in-depth understanding, such as document summarization and complex conversational agents. With an input price of $0.65 per million tokens and an output price of $0.75 per million tokens, teams can effectively budget for high-volume projects while leveraging the model's capabilities for nuanced content generation. This model is particularly beneficial for enterprises that need to process large datasets or maintain continuity in long-form interactions.
Context Window
131,072
Input tokens
Full-context input ≈ $0.11
Max Output
16,384
Completion tokens
Input Price / 1M
$0.8500
Prompt tokens
Output Price / 1M
$0.8500
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.8500
Current Output / 1M
$0.8500
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.8500 |
| Output (Completion) | $0.8500 |
Estimate monthly spend for Sao10K: Llama 3.1 Euryale 70B v2.2 based on your workload.
Estimated Monthly Cost
$31
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.