Context Window
32,768
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: google· Tokenizer: Gemini · Instruct: gemma · Quantization: unknown
Canonical ID: google/gemma-3-4b-it
Pricing updated Apr 22, 2026
Live Pricing
Input: $0.0000
Output: $0.0000
Last synced Apr 22, 2026
Google's Gemma 3 4B model offers a substantial context window of 32,768 tokens, making it suitable for applications that require extensive text analysis or generation, such as summarizing long documents or engaging in complex conversations. As a free API model, it allows teams to leverage advanced natural language processing capabilities without incurring input or output costs, making it an attractive option for startups and research projects. This cost-effective solution enables organizations to experiment and scale their AI initiatives without financial barriers, facilitating innovation in various sectors.
Provider Compliance
Context Window
32,768
Input tokens
Full-context input ≈ $0.00
Max Output
8,192
Completion tokens
Input Price / 1M
$0.0000
Prompt tokens
Output Price / 1M
$0.0000
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.000000
Current Output / 1M
$0.000000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0000 |
| Output (Completion) | $0.0000 |
Estimate monthly spend for Google: Gemma 3 4B (free) based on your workload.
Estimated Monthly Cost
$0.00
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.