Context Window
1,048,576
Input tokens
Full-context input ≈ $0.26
Model Cost Profile
Developer: google· Tokenizer: Gemini · Quantization: unknown
Canonical ID: google/gemini-3.1-flash-lite-20260507
Pricing updated May 8, 2026
Live Pricing
Input: $0.2500
Output: $1.50
Last synced May 8, 2026 · MMLU score via public benchmark data
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Provider Compliance
Context Window
1,048,576
Input tokens
Full-context input ≈ $0.26
Max Output
65,536
Completion tokens
Input Price / 1M
$0.2500
Prompt tokens
Output Price / 1M
$1.50
Completion tokens
Top Benchmark
75.9
MMLU score — highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Google: Gemini 3.1 Flash Lite. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Not enough data yet. Price tracking started recently — check back in a few days.
Performance History
Not enough data yet. Performance tracking started recently — check back in a few days.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.2500 |
| Output (Completion) | $1.50 |
| Cache Read | $0.0250 |
| Cache Write | $0.083333 |
| Internal Reasoning | $1.50 |
| Image Processing | $0.2500 |
| Web Search | $14,000.00 |
| Audio | $0.5000 |
Estimate monthly spend for Google: Gemini 3.1 Flash Lite based on your workload.
Estimated Monthly Cost
$24
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.