Context Window
262,144
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: google· Tokenizer: Gemma · Quantization: unknown
Canonical ID: google/gemma-4-31b-it-20260402
Pricing updated May 23, 2026
Live Pricing
Input: $0.0000
Output: $0.0000
Last synced May 23, 2026
Google's Gemma 4 31B model offers a substantial context window of 262,144 tokens, making it ideal for applications that require extensive document analysis or long-form content generation. With a pricing structure of $0.00 for both input and output tokens, it provides a cost-effective solution for teams looking to integrate advanced language processing capabilities without incurring expenses. This model is particularly useful for research institutions and developers focusing on large-scale data processing or conversational AI systems.
Provider Compliance
Context Window
262,144
Input tokens
Full-context input ≈ $0.00
Max Output
32,768
Completion tokens
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.4%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Estimate monthly spend for Google: Gemma 4 31B (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.