Context Window
1,048,576
Input tokens
Full-context input ≈ $0.31
Model Cost Profile
Developer: google· Tokenizer: Gemini · Quantization: unknown
Canonical ID: google/gemini-2.5-flash
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.3000
Output: $2.50
Last synced Apr 24, 2026 · MMLU score via public benchmark data
Google's Gemini 2.5 Flash model offers a substantial context window of 1,048,576 tokens, making it suitable for complex applications such as conversational AI, document summarization, and large-scale data analysis. With an input price of $0.30 per million tokens and an output price of $2.50 per million tokens, teams can effectively manage their budget while leveraging the model's capabilities for extensive text generation and processing tasks. This pricing structure allows organizations to scale their usage according to project demands, ensuring cost efficiency in high-volume scenarios.
Provider Compliance
Context Window
1,048,576
Input tokens
Full-context input ≈ $0.31
Max Output
65,535
Completion tokens
Input Price / 1M
$0.3000
Prompt tokens
Output Price / 1M
$2.50
Completion tokens
Top Benchmark
88.2
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Google: Gemini 2.5 Flash. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Current Input / 1M
$0.3000
Current Output / 1M
$2.50
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.4%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.3000 |
| Output (Completion) | $2.50 |
| Cache Read | $0.0300 |
| Cache Write | $0.083333 |
| Internal Reasoning | $2.50 |
| Image Processing | $0.3000 |
| Web Search | $14,000.00 |
| Audio | $1.00 |
Estimate monthly spend for Google: Gemini 2.5 Flash based on your workload.
Estimated Monthly Cost
$38
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.