Context Window
1,048,576
Input tokens
Full-context input ≈ $0.10
Model Cost Profile
Developer: google· Tokenizer: Gemini · Quantization: unknown
Canonical ID: google/gemini-2.5-flash-lite
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.1000
Output: $0.4000
Last synced Apr 25, 2026 · MMLU score via public benchmark data
Google's Gemini 2.5 Flash Lite offers an extensive context window of 1,048,576 tokens, making it suitable for applications requiring deep contextual understanding, such as complex document analysis and conversational AI. With an input pricing of $0.10 per million tokens and an output pricing of $0.40 per million tokens, teams can effectively manage costs while leveraging its capabilities for large-scale projects. This model is particularly advantageous for businesses needing high-volume data processing without sacrificing performance or accuracy.
Provider Compliance
Context Window
1,048,576
Input tokens
Full-context input ≈ $0.10
Max Output
65,535
Completion tokens
Input Price / 1M
$0.1000
Prompt tokens
Output Price / 1M
$0.4000
Completion tokens
Top Benchmark
75.9
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Google: Gemini 2.5 Flash Lite. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.1000
Current Output / 1M
$0.4000
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.9%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1000 |
| Output (Completion) | $0.4000 |
| Cache Read | $0.0100 |
| Cache Write | $0.083333 |
| Internal Reasoning | $0.4000 |
| Image Processing | $0.1000 |
| Web Search | $14,000.00 |
| Audio | $0.3000 |
Estimate monthly spend for Google: Gemini 2.5 Flash Lite based on your workload.
Estimated Monthly Cost
$7.30
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.