Context Window
131,072
Input tokens
Full-context input ≈ $0.01
Model Cost Profile
Developer: google· Tokenizer: Gemini · Instruct: gemma · Quantization: bf16
Canonical ID: google/gemma-3-4b-it
Pricing updated Apr 23, 2026
Live Pricing
Input: $0.0400
Output: $0.0800
Last synced Apr 23, 2026
Google's Gemma 3 4B model features an extensive context window of 131,072 tokens, making it ideal for applications requiring in-depth analysis such as legal document review or extensive content generation. With an input price of $0.04 per million tokens and an output price of $0.08 per million tokens, teams can effectively manage costs while leveraging the model for tasks like customer support automation or data summarization. This pricing structure allows organizations to scale their usage based on project needs, ensuring budget flexibility for various API-driven initiatives.
Provider Compliance
Context Window
131,072
Input tokens
Full-context input ≈ $0.01
Max Output
—
Not specified
Input Price / 1M
$0.0400
Prompt tokens
Output Price / 1M
$0.0800
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.0400
Current Output / 1M
$0.0800
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0400 |
| Output (Completion) | $0.0800 |
Estimate monthly spend for Google: Gemma 3 4B based on your workload.
Estimated Monthly Cost
$1.96
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.