Context Window
131,072
Tokens
Model Cost Profile
Developer: google
Pricing updated Mar 11, 2026
Google's Gemma 3 4B model features an extensive context window of 131,072 tokens, making it ideal for applications requiring in-depth analysis such as legal document review or extensive content generation. With an input price of $0.04 per million tokens and an output price of $0.08 per million tokens, teams can effectively manage costs while leveraging the model for tasks like customer support automation or data summarization. This pricing structure allows organizations to scale their usage based on project needs, ensuring budget flexibility for various API-driven initiatives.
Context Window
131,072
Tokens
Input Price / 1M
$0.0400
Prompt tokens
Output Price / 1M
$0.0800
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0400 |
| Output (Completion) | $0.0800 |
Price History
Current Input / 1M
$0.0400
Current Output / 1M
$0.0800
Estimate monthly spend for Google: Gemma 3 4B based on your workload.
Estimated Monthly Cost
$1.96
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Google: Gemma 3 4B.
Google: Gemma 3 4B input pricing is $0.0400 per 1M tokens based on the latest synced provider data.
Google: Gemma 3 4B output pricing is $0.0800 per 1M tokens based on the latest synced provider data.
Google: Gemma 3 4B supports a context window of 131,072 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.