Context Window
131,072
Input tokens
Full-context input ≈ $0.01
Model Cost Profile
Developer: google· Tokenizer: Gemini · Instruct: gemma · Quantization: bf16
Canonical ID: google/gemma-3-12b-it
Pricing updated Apr 23, 2026
Live Pricing
Input: $0.0400
Output: $0.1300
Last synced Apr 23, 2026
Google's Gemma 3 12B model offers a substantial context window of 131,072 tokens, making it ideal for applications requiring extensive text comprehension, such as legal document analysis and long-form content generation. With an input price of $0.04 per million tokens and an output price of $0.13 per million tokens, teams can effectively manage costs while leveraging the model for high-volume tasks. This pricing structure allows organizations to scale their usage based on specific project needs, ensuring budget-friendly access to advanced AI capabilities.
Provider Compliance
Context Window
131,072
Input tokens
Full-context input ≈ $0.01
Max Output
—
Not specified
Input Price / 1M
$0.0400
Prompt tokens
Output Price / 1M
$0.1300
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.0400
Current Output / 1M
$0.1300
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.9%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0400 |
| Output (Completion) | $0.1300 |
Estimate monthly spend for Google: Gemma 3 12B based on your workload.
Estimated Monthly Cost
$2.56
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.