Context Window
32,768
Input tokens
Full-context input ≈ $0.00
Model Cost Profile
Developer: google· Tokenizer: Other · Quantization: unknown
Canonical ID: google/gemma-3n-e4b-it
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.0600
Output: $0.1200
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Google's Gemma 3n 4B model features a context window of 32,768 tokens, making it suitable for applications requiring extensive text analysis, such as legal document review or long-form content generation. With an input price of $0.02 per 1 million tokens and an output price of $0.04 per 1 million tokens, teams can effectively manage costs while leveraging the model for data-intensive tasks. This pricing structure allows for scalable usage, enabling businesses to optimize their budget based on the volume of text processed and generated.
Provider Compliance
Context Window
32,768
Input tokens
Full-context input ≈ $0.00
Max Output
—
Not specified
Input Price / 1M
$0.0600
Prompt tokens
Output Price / 1M
$0.1200
Completion tokens
Top Benchmark
48.8
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Google: Gemma 3n 4B. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0600
Current Output / 1M
$0.1200
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0600 |
| Output (Completion) | $0.1200 |
Estimate monthly spend for Google: Gemma 3n 4B based on your workload.
Estimated Monthly Cost
$2.94
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.