Context Window
131,072
Input tokens
Full-context input โ $0.01
Model Cost Profile
Developer: ibm-graniteยท Tokenizer: Other ยท Quantization: bf16
Canonical ID: ibm-granite/granite-4.1-8b-20260429
Pricing updated May 1, 2026
Live Pricing
Input: $0.0500
Output: $0.1000
Last synced May 1, 2026
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...
Context Window
131,072
Input tokens
Full-context input โ $0.01
Max Output
131,072
Completion tokens
Input Price / 1M
$0.0500
Prompt tokens
Output Price / 1M
$0.1000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Not enough data yet. Price tracking started recently โ check back in a few days.
Performance History
Not enough data yet. Performance tracking started recently โ check back in a few days.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0500 |
| Output (Completion) | $0.1000 |
| Cache Read | $0.0500 |
Estimate monthly spend for IBM: Granite 4.1 8B based on your workload.
Estimated Monthly Cost
$2.45
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.