Context Window
131,000
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: ibm-granite· Tokenizer: Other · Quantization: unknown
Canonical ID: ibm-granite/granite-4.0-h-micro
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0170
Output: $0.1100
Last synced Apr 24, 2026
IBM's Granite 4.0 Micro model, developed by ibm-granite, offers a substantial context window of 131,000 tokens, making it ideal for applications requiring extensive context, such as legal document analysis or large-scale content generation. With an input price of $0.02 per 1 million tokens and an output price of $0.11 per 1 million tokens, teams can effectively manage costs while leveraging the model for complex tasks like data summarization and conversational AI. This pricing structure allows organizations to scale their usage based on specific project needs, ensuring budget-friendly access to advanced AI capabilities.
Context Window
131,000
Input tokens
Full-context input ≈ $0.00
Max Output
—
Not specified
Input Price / 1M
$0.0170
Prompt tokens
Output Price / 1M
$0.1100
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.0170
Current Output / 1M
$0.1100
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0170 |
| Output (Completion) | $0.1100 |
Estimate monthly spend for IBM: Granite 4.0 Micro based on your workload.
Estimated Monthly Cost
$1.75
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.