Context Window
1,048,576
Input tokens
Full-context input β $1.57
Model Cost Profile
Developer: googleΒ· Tokenizer: Gemini Β· Quantization: unknown
Canonical ID: google/gemini-3.5-flash-20260519
Pricing updated May 20, 2026
Live Pricing
Input: $1.50
Output: $9.00
Last synced May 20, 2026 Β· MMLU score via public benchmark data
Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...
Provider Compliance
Context Window
1,048,576
Input tokens
Full-context input β $1.57
Max Output
65,536
Completion tokens
Input Price / 1M
$1.50
Prompt tokens
Output Price / 1M
$9.00
Completion tokens
Top Benchmark
89.8
GPQA score β highest of MMLU, GPQA, MATH, HumanEval
Evaluation scores for Google: Gemini 3.5 Flash. The βTop Benchmarkβ shown above is the highest score across MMLU, GPQA, MATH & HumanEval.
Price History
Not enough data yet. Price tracking started recently β check back in a few days.
Performance History
Not enough data yet. Performance tracking started recently β check back in a few days.
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $1.50 |
| Output (Completion) | $9.00 |
| Cache Read | $0.1500 |
| Cache Write | $0.083333 |
| Internal Reasoning | $9.00 |
| Image Processing | $1.50 |
| Web Search | $14,000.00 |
| Audio | $3.00 |
Estimate monthly spend for Google: Gemini 3.5 Flash based on your workload.
Estimated Monthly Cost
$146
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.