Context Window
1,048,576
Input tokens
Full-context input โ $0.26
Model Cost Profile
Developer: googleยท Tokenizer: Gemini ยท Quantization: unknown
Canonical ID: google/gemini-3.1-flash-lite-preview-20260303
Pricing updated Apr 22, 2026
Live Pricing
Input: $0.2500
Output: $1.50
Last synced Apr 22, 2026
Google's Gemini 3.1 Flash Lite Preview offers a substantial context window of 1,048,576 tokens, making it suitable for complex applications such as conversational agents and large-scale document processing. With an input price of $0.25 per million tokens and an output price of $1.50 per million tokens, teams can effectively manage costs while leveraging its advanced capabilities for data-intensive tasks. This model is particularly beneficial for organizations that require high throughput and extensive context handling in their AI-driven solutions.
Provider Compliance
Context Window
1,048,576
Input tokens
Full-context input โ $0.26
Max Output
65,536
Completion tokens
Input Price / 1M
$0.2500
Prompt tokens
Output Price / 1M
$1.50
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.2500
Current Output / 1M
$1.50
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.3%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.2500 |
| Output (Completion) | $1.50 |
| Cache Read | $0.0250 |
| Cache Write | $0.083333 |
| Internal Reasoning | $1.50 |
| Image Processing | $0.2500 |
| Web Search | $14,000.00 |
| Audio | $0.5000 |
Estimate monthly spend for Google: Gemini 3.1 Flash Lite Preview based on your workload.
Estimated Monthly Cost
$24
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.