Context Window
131,072
Input tokens
Full-context input ≈ $0.09
Model Cost Profile
Developer: sao10k· Tokenizer: Llama3 · Instruct: llama3 · Quantization: bf16
Canonical ID: sao10k/l3.3-euryale-70b-v2.3
Pricing updated Apr 23, 2026
Live Pricing
Input: $0.6500
Output: $0.7500
Last synced Apr 23, 2026
Sao10K: Llama 3.3 Euryale 70B is designed for applications requiring extensive context handling, making it ideal for complex tasks such as document summarization and conversational AI with a context window of 131072 tokens. Teams utilizing this API model can expect a cost of $0.65 per million input tokens and $0.75 per million output tokens, which can significantly impact budget planning for large-scale projects. This pricing structure allows for flexible scaling, enabling organizations to optimize usage based on their specific data processing needs.
Context Window
131,072
Input tokens
Full-context input ≈ $0.09
Max Output
16,384
Completion tokens
Input Price / 1M
$0.6500
Prompt tokens
Output Price / 1M
$0.7500
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.6500
Current Output / 1M
$0.7500
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
96.2%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.6500 |
| Output (Completion) | $0.7500 |
Estimate monthly spend for Sao10K: Llama 3.3 Euryale 70B based on your workload.
Estimated Monthly Cost
$25
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.