Model Cost Profile

IBM: Granite 4.0 Micro

Developer: ibm-granite· Tokenizer: Other · Quantization: unknown

Canonical ID: ibm-granite/granite-4.0-h-micro

Pricing updated Apr 24, 2026

Input rank: #32Output rank: #44

Live Pricing

Input: $0.0170

Output: $0.1100

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026

IBM's Granite 4.0 Micro model, developed by ibm-granite, offers a substantial context window of 131,000 tokens, making it ideal for applications requiring extensive context, such as legal document analysis or large-scale content generation. With an input price of $0.02 per 1 million tokens and an output price of $0.11 per 1 million tokens, teams can effectively manage costs while leveraging the model for complex tasks like data summarization and conversational AI. This pricing structure allows organizations to scale their usage based on specific project needs, ensuring budget-friendly access to advanced AI capabilities.

Context Window

131,000

Input tokens

Full-context input ≈ $0.00

Max Output

—

Not specified

Input Price / 1M

$0.0170

Prompt tokens

Output Price / 1M

$0.1100

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

IBM: Granite 4.0 Micro Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0170

Current Output / 1M

$0.1100

Performance History

IBM: Granite 4.0 Micro Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0170
Output (Completion)	$0.1100

Compare with Meta: Llama 3.1 8B Instruct Compare with Mistral: Mistral Nemo Compare with Meta: Llama 3.2 1B Instruct

Cost Calculator

Estimate monthly spend for IBM: Granite 4.0 Micro based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$1.75

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$1.75 Free Models Router$0.00−$1.75 Google: Gemma 3 12B (free)$0.00−$1.75 Google: Gemma 3 27B (free)$0.00−$1.75

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

IBM: Granite 4.0 Micro vs Baidu: Qianfan-OCR-Fast (free)IBM: Granite 4.0 Micro vs Free Models Router IBM: Granite 4.0 Micro vs Google: Gemma 3 12B (free)IBM: Granite 4.0 Micro vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0170

Output (Completion)

$0.1100

Cost Calculator

Estimate monthly spend for IBM: Granite 4.0 Micro based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$1.75

25M input + 12M output tokens