Context Window
32,768
Input tokens
Full-context input ≈ $0.01
Model Cost Profile
Developer: thedrummer· Tokenizer: Qwen · Instruct: chatml · Quantization: bf16
Canonical ID: thedrummer/rocinante-12b
Pricing updated Apr 23, 2026
Live Pricing
Input: $0.1700
Output: $0.4300
Last synced Apr 23, 2026
TheDrummer: Rocinante 12B offers a substantial context window of 32,768 tokens, making it suitable for applications requiring extensive text comprehension, such as legal document analysis and long-form content generation. With an input price of $0.17 per million tokens and an output price of $0.43 per million tokens, teams can effectively manage costs while leveraging the model for complex tasks like customer support automation and advanced data analysis. This pricing structure allows organizations to scale their usage according to project needs, optimizing budget allocation for API-based solutions.
Context Window
32,768
Input tokens
Full-context input ≈ $0.01
Max Output
32,768
Completion tokens
Input Price / 1M
$0.1700
Prompt tokens
Output Price / 1M
$0.4300
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.1700
Current Output / 1M
$0.4300
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.4%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1700 |
| Output (Completion) | $0.4300 |
Estimate monthly spend for TheDrummer: Rocinante 12B based on your workload.
Estimated Monthly Cost
$9.41
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.