Context Window
16,000
Input tokens
Full-context input ≈ $0.05
Model Cost Profile
Developer: sao10k· Tokenizer: Llama3 · Quantization: bf16
Canonical ID: sao10k/l3.1-70b-hanami-x1
Pricing updated Apr 23, 2026
Live Pricing
Input: $3.00
Output: $3.00
Last synced Apr 23, 2026
Sao10K: Llama 3.1 70B Hanami x1 offers a substantial context window of 16,000 tokens, making it suitable for applications requiring in-depth analysis, such as document summarization and complex dialogue systems. With an input and output pricing of $3.00 per 1 million tokens, teams can effectively manage costs while scaling their usage for tasks like content generation and real-time data processing. This model is ideal for organizations that need a balance of performance and affordability in their AI-driven solutions.
Context Window
16,000
Input tokens
Full-context input ≈ $0.05
Max Output
—
Not specified
Input Price / 1M
$3.00
Prompt tokens
Output Price / 1M
$3.00
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$3.00
Current Output / 1M
$3.00
Performance History
Not enough data yet. Performance tracking started recently — check back in a few days.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $3.00 |
| Output (Completion) | $3.00 |
Estimate monthly spend for Sao10K: Llama 3.1 70B Hanami x1 based on your workload.
Estimated Monthly Cost
$111
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.