Context Window
131,000
Tokens
Model Cost Profile
Developer: ibm-granite
Pricing updated Mar 11, 2026
IBM's Granite 4.0 Micro model, developed by ibm-granite, offers a substantial context window of 131,000 tokens, making it ideal for applications requiring extensive context, such as legal document analysis or large-scale content generation. With an input price of $0.02 per 1 million tokens and an output price of $0.11 per 1 million tokens, teams can effectively manage costs while leveraging the model for complex tasks like data summarization and conversational AI. This pricing structure allows organizations to scale their usage based on specific project needs, ensuring budget-friendly access to advanced AI capabilities.
Context Window
131,000
Tokens
Input Price / 1M
$0.0170
Prompt tokens
Output Price / 1M
$0.1100
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0170 |
| Output (Completion) | $0.1100 |
Price History
Current Input / 1M
$0.0170
Current Output / 1M
$0.1100
Estimate monthly spend for IBM: Granite 4.0 Micro based on your workload.
Estimated Monthly Cost
$1.75
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for IBM: Granite 4.0 Micro.
IBM: Granite 4.0 Micro input pricing is $0.0170 per 1M tokens based on the latest synced provider data.
IBM: Granite 4.0 Micro output pricing is $0.1100 per 1M tokens based on the latest synced provider data.
IBM: Granite 4.0 Micro supports a context window of 131,000 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.