Context Window
16,384
Input tokens
Full-context input ≈ $0.05
Model Cost Profile
Developer: anthracite-org· Tokenizer: Qwen · Instruct: chatml · Quantization: fp8
Canonical ID: anthracite-org/magnum-v4-72b
Pricing updated Apr 21, 2026
Live Pricing
Input: $3.00
Output: $5.00
Last synced Apr 21, 2026
The Magnum v4 72B model from anthracite-org offers a substantial context window of 16,384 tokens, making it ideal for applications requiring extensive text analysis, such as legal document review or comprehensive content generation. With an input price of $3.00 per million tokens and an output price of $5.00 per million tokens, teams can effectively manage costs while leveraging its capabilities for projects like customer support automation and advanced data summarization. This model is particularly suited for organizations that need to process large datasets or generate lengthy narratives without sacrificing performance or incurring excessive expenses.
Context Window
16,384
Input tokens
Full-context input ≈ $0.05
Max Output
2,048
Completion tokens
Input Price / 1M
$3.00
Prompt tokens
Output Price / 1M
$5.00
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$3.00
Current Output / 1M
$5.00
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $3.00 |
| Output (Completion) | $5.00 |
Estimate monthly spend for Magnum v4 72B based on your workload.
Estimated Monthly Cost
$135
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.