Context Window
8,192
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: sao10k· Tokenizer: Llama3 · Instruct: llama3 · Quantization: fp8
Canonical ID: sao10k/l3-lunaris-8b
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0400
Output: $0.0500
Last synced Apr 24, 2026
Sao10K's Llama 3 8B Lunaris model features an extensive context window of 8192 tokens, making it suitable for complex applications such as conversational AI, document summarization, and content generation. With an input price of $0.04 per 1 million tokens and an output price of $0.05 per million tokens, teams can effectively manage costs while leveraging the model for high-volume tasks. This pricing structure allows for scalable deployment in various industries, including customer support, marketing, and data analysis.
Context Window
8,192
Input tokens
Full-context input ≈ $0.00
Max Output
—
Not specified
Input Price / 1M
$0.0400
Prompt tokens
Output Price / 1M
$0.0500
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.0400
Current Output / 1M
$0.0500
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0400 |
| Output (Completion) | $0.0500 |
Estimate monthly spend for Sao10K: Llama 3 8B Lunaris based on your workload.
Estimated Monthly Cost
$1.60
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.