Model Cost Profile

Google: Gemini 2.5 Flash Lite

Developer: google· Tokenizer: Gemini · Quantization: unknown

Canonical ID: google/gemini-2.5-flash-lite

Pricing updated Apr 25, 2026

Input rank: #84Output rank: #100

Live Pricing

Input: $0.1000

Output: $0.4000

Visit Google ↗View full pricing leaderboard

Last synced Apr 25, 2026 · MMLU score via public benchmark data

Google's Gemini 2.5 Flash Lite offers an extensive context window of 1,048,576 tokens, making it suitable for applications requiring deep contextual understanding, such as complex document analysis and conversational AI. With an input pricing of $0.10 per million tokens and an output pricing of $0.40 per million tokens, teams can effectively manage costs while leveraging its capabilities for large-scale projects. This model is particularly advantageous for businesses needing high-volume data processing without sacrificing performance or accuracy.

💡 Enable prompt caching to save 90% on repeated input tokens ($0.0100/M cached vs $0.1000/M standard).

👁 Vision🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Provider Compliance

SOC 2HIPAAFedRAMPGDPRISO 27001

Context Window

1,048,576

Input tokens

Full-context input ≈ $0.10

Max Output

65,535

Completion tokens

Input Price / 1M

$0.1000

Prompt tokens

Output Price / 1M

$0.4000

Completion tokens

Top Benchmark

75.9

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Google: Gemini 2.5 Flash Lite. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	62.5	#61 of 125	artificial_analysis
MMLU	75.9	#65 of 127	artificial_analysis

Price History

Google: Gemini 2.5 Flash Lite Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.1000

Current Output / 1M

$0.4000

Performance History

Google: Gemini 2.5 Flash Lite Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

99.9%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.1000
Output (Completion)	$0.4000
Cache Read	$0.0100
Cache Write	$0.083333
Internal Reasoning	$0.4000
Image Processing	$0.1000
Web Search	$14,000.00
Audio	$0.3000

Compare with Google: Gemini 2.0 Flash Compare with ByteDance Seed: Seed-2.0-Mini Compare with ByteDance: UI-TARS 7B

Cost Calculator

Estimate monthly spend for Google: Gemini 2.5 Flash Lite based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$7.30

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$7.30 Free Models Router$0.00−$7.30 Google: Gemma 3 12B (free)$0.00−$7.30 Google: Gemma 3 27B (free)$0.00−$7.30

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Google: Gemini 2.5 Flash Lite vs Baidu: Qianfan-OCR-Fast (free)Google: Gemini 2.5 Flash Lite vs Free Models Router Google: Gemini 2.5 Flash Lite vs Google: Gemma 3 12B (free)Google: Gemini 2.5 Flash Lite vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

62.5

#61 of 125

artificial_analysis

MMLU

75.9

#65 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.1000

Output (Completion)

$0.4000

Cache Read

$0.0100

Cache Write

$0.083333

Internal Reasoning

$0.4000

Image Processing

$0.1000

Web Search

$14,000.00

Audio

$0.3000

Cost Calculator

Estimate monthly spend for Google: Gemini 2.5 Flash Lite based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$7.30

25M input + 12M output tokens