Context Window
262,144
Input tokens
Full-context input โ $0.04
Model Cost Profile
Developer: googleยท Tokenizer: Gemini
Pricing updated Apr 3, 2026
Live Pricing
Input: $0.1400
Output: $0.4000
Last synced Apr 3, 2026
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function calling, and multilingual support across 140+ languages. Strong on coding, reasoning, and document understanding tasks. Apache 2.0 license.
Provider Compliance
Context Window
262,144
Input tokens
Full-context input โ $0.04
Max Output
131,072
Completion tokens
Input Price / 1M
$0.1400
Prompt tokens
Output Price / 1M
$0.4000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Not enough data yet. Price tracking started recently โ check back in a few days.
Performance History
Not enough data yet. Performance tracking started recently โ check back in a few days.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1400 |
| Output (Completion) | $0.4000 |
Estimate monthly spend for Google: Gemma 4 31B based on your workload.
Estimated Monthly Cost
$8.30
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.