Context Window
262,144
Input tokens
Full-context input ≈ $0.03
Model Cost Profile
Developer: google· Tokenizer: Gemma · Quantization: fp4
Canonical ID: google/gemma-4-31b-it-20260402
Pricing updated May 20, 2026
Live Pricing
Input: $0.1200
Output: $0.3700
Last synced May 20, 2026
Google's Gemma 4 31B model offers an extensive context window of 262,144 tokens, making it suitable for applications that require processing large volumes of text, such as legal document analysis or long-form content generation. With an input cost of $0.13 per million tokens and an output cost of $0.38 per million tokens, teams can effectively manage their budgets while leveraging advanced AI capabilities for tasks like customer support automation or data extraction. This model's scalability and pricing structure make it an attractive option for businesses looking to integrate AI into their workflows without incurring prohibitive costs.
Provider Compliance
Context Window
262,144
Input tokens
Full-context input ≈ $0.03
Max Output
16,384
Completion tokens
Input Price / 1M
$0.1200
Prompt tokens
Output Price / 1M
$0.3700
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.1200
Current Output / 1M
$0.3700
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.6%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1200 |
| Output (Completion) | $0.3700 |
Estimate monthly spend for Google: Gemma 4 31B based on your workload.
Estimated Monthly Cost
$7.44
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.