Context Window
8,192
Tokens
Model Cost Profile
Developer: google
Pricing updated Mar 11, 2026
Google's Gemma 2 9B model offers a substantial context window of 8192 tokens, making it suitable for applications requiring extensive text analysis, such as document summarization and conversational AI. With an input cost of $0.03 per 1M tokens and an output cost of $0.09 per 1M tokens, teams can effectively manage their budgets while scaling their usage based on project needs. This model is particularly beneficial for organizations looking to enhance their natural language processing capabilities without incurring excessive operational expenses.
Context Window
8,192
Tokens
Input Price / 1M
$0.0300
Prompt tokens
Output Price / 1M
$0.0900
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0300 |
| Output (Completion) | $0.0900 |
Price History
Current Input / 1M
$0.0300
Current Output / 1M
$0.0900
Estimate monthly spend for Google: Gemma 2 9B based on your workload.
Estimated Monthly Cost
$1.83
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Google: Gemma 2 9B.
Google: Gemma 2 9B input pricing is $0.0300 per 1M tokens based on the latest synced provider data.
Google: Gemma 2 9B output pricing is $0.0900 per 1M tokens based on the latest synced provider data.
Google: Gemma 2 9B supports a context window of 8,192 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.