Model Cost Profile

Google: Gemini 2.5 Flash

Developer: google· Tokenizer: Gemini · Quantization: unknown

Canonical ID: google/gemini-2.5-flash

Pricing updated Apr 24, 2026

Input rank: #171Output rank: #240

Live Pricing

Input: $0.3000

Output: $2.50

Visit Google ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Google's Gemini 2.5 Flash model offers a substantial context window of 1,048,576 tokens, making it suitable for complex applications such as conversational AI, document summarization, and large-scale data analysis. With an input price of $0.30 per million tokens and an output price of $2.50 per million tokens, teams can effectively manage their budget while leveraging the model's capabilities for extensive text generation and processing tasks. This pricing structure allows organizations to scale their usage according to project demands, ensuring cost efficiency in high-volume scenarios.

💡 Enable prompt caching to save 90% on repeated input tokens ($0.0300/M cached vs $0.3000/M standard).

👁 Vision🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Provider Compliance

SOC 2HIPAAFedRAMPGDPRISO 27001

Context Window

1,048,576

Input tokens

Full-context input ≈ $0.31

Max Output

65,535

Completion tokens

Input Price / 1M

$0.3000

Prompt tokens

Output Price / 1M

$2.50

Completion tokens

Top Benchmark

88.2

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Google: Gemini 2.5 Flash. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	81.2	#18 of 125	artificial_analysis
MMLU	88.2	#1 of 127	artificial_analysis

Price History

Google: Gemini 2.5 Flash Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.3000

Current Output / 1M

$2.50

Performance History

Google: Gemini 2.5 Flash Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

99.4%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.3000
Output (Completion)	$2.50
Cache Read	$0.0300
Cache Write	$0.083333
Internal Reasoning	$2.50
Image Processing	$0.3000
Web Search	$14,000.00
Audio	$1.00

Compare with Google: Nano Banana (Gemini 2.5 Flash Image)Compare with Amazon: Nova 2 Lite Compare with Kwaipilot: KAT-Coder-Pro V2

Cost Calculator

Estimate monthly spend for Google: Gemini 2.5 Flash based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$38

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$38 Free Models Router$0.00−$38 Google: Gemma 3 12B (free)$0.00−$38 Google: Gemma 3 27B (free)$0.00−$38

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Google: Gemini 2.5 Flash vs Baidu: Qianfan-OCR-Fast (free)Google: Gemini 2.5 Flash vs Free Models Router Google: Gemini 2.5 Flash vs Google: Gemma 3 12B (free)Google: Gemini 2.5 Flash vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

81.2

#18 of 125

artificial_analysis

MMLU

88.2

#1 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.3000

Output (Completion)

$2.50

Cache Read

$0.0300

Cache Write

$0.083333

Internal Reasoning

$2.50

Image Processing

$0.3000

Web Search

$14,000.00

Audio

$1.00

Cost Calculator

Estimate monthly spend for Google: Gemini 2.5 Flash based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$38

25M input + 12M output tokens