Context Window
32,768
Tokens
Model Cost Profile
Developer: google
Pricing updated Mar 11, 2026
Google's Gemma 3 4B model offers a substantial context window of 32,768 tokens, making it suitable for applications that require extensive text analysis or generation, such as summarizing long documents or engaging in complex conversations. As a free API model, it allows teams to leverage advanced natural language processing capabilities without incurring input or output costs, making it an attractive option for startups and research projects. This cost-effective solution enables organizations to experiment and scale their AI initiatives without financial barriers, facilitating innovation in various sectors.
Context Window
32,768
Tokens
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Estimate monthly spend for Google: Gemma 3 4B (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Google: Gemma 3 4B (free).
Google: Gemma 3 4B (free) input pricing is $0.0000 per 1M tokens based on the latest synced provider data.
Google: Gemma 3 4B (free) output pricing is $0.0000 per 1M tokens based on the latest synced provider data.
Google: Gemma 3 4B (free) supports a context window of 32,768 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.