Context Window
1,048,576
Input tokens
Full-context input ≈ $0.08
Model Cost Profile
Developer: google· Tokenizer: Gemini · Quantization: unknown
Canonical ID: google/gemini-2.0-flash-lite-001
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.0750
Output: $0.3000
Last synced Apr 25, 2026 · MMLU score via public benchmark data
Google's Gemini 2.0 Flash Lite offers a substantial context window of 1,048,576 tokens, making it suitable for applications requiring extensive data processing, such as natural language understanding and large-scale content generation. Teams leveraging this API model can expect an input cost of $0.07 per million tokens and an output cost of $0.30 per million tokens, allowing for budget-friendly scalability in projects involving complex queries and responses. Its design caters to businesses that need efficient and high-capacity processing for real-time analytics and interactive AI solutions.
Provider Compliance
Context Window
1,048,576
Input tokens
Full-context input ≈ $0.08
Max Output
8,192
Completion tokens
Input Price / 1M
$0.0750
Prompt tokens
Output Price / 1M
$0.3000
Completion tokens
Top Benchmark
72.4
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Google: Gemini 2.0 Flash Lite. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.0750
Current Output / 1M
$0.3000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.0750 |
| Output (Completion) | $0.3000 |
| Internal Reasoning | $0.3000 |
| Image Processing | $0.0750 |
| Web Search | $14,000.00 |
| Audio | $0.0750 |
Estimate monthly spend for Google: Gemini 2.0 Flash Lite based on your workload.
Estimated Monthly Cost
$5.48
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.