Model Cost Profile

Google: Gemini 3.1 Flash Lite Preview

Developer: google· Tokenizer: Gemini · Quantization: unknown

Canonical ID: google/gemini-3.1-flash-lite-preview-20260303

Pricing updated Apr 22, 2026

Input rank: #150Output rank: #195

Live Pricing

Input: $0.2500

Output: $1.50

Visit Google ↗View full pricing leaderboard

Last synced Apr 22, 2026

Google's Gemini 3.1 Flash Lite Preview offers a substantial context window of 1,048,576 tokens, making it suitable for complex applications such as conversational agents and large-scale document processing. With an input price of $0.25 per million tokens and an output price of $1.50 per million tokens, teams can effectively manage costs while leveraging its advanced capabilities for data-intensive tasks. This model is particularly beneficial for organizations that require high throughput and extensive context handling in their AI-driven solutions.

💡 Enable prompt caching to save 90% on repeated input tokens ($0.0250/M cached vs $0.2500/M standard).

👁 Vision🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning💾 Implicit Caching

Provider Compliance

SOC 2HIPAAFedRAMPGDPRISO 27001

Context Window

1,048,576

Input tokens

Full-context input ≈ $0.26

Max Output

65,536

Completion tokens

Input Price / 1M

$0.2500

Prompt tokens

Output Price / 1M

$1.50

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Google: Gemini 3.1 Flash Lite Preview Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.2500

Current Output / 1M

$1.50

Performance History

Google: Gemini 3.1 Flash Lite Preview Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

99.3%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.2500
Output (Completion)	$1.50
Cache Read	$0.0250
Cache Write	$0.083333
Internal Reasoning	$1.50
Image Processing	$0.2500
Web Search	$14,000.00
Audio	$0.5000

Compare with Google: Gemini 2.5 Flash Compare with Anthropic: Claude 3 Haiku Compare with ByteDance Seed: Seed 1.6

Cost Calculator

Estimate monthly spend for Google: Gemini 3.1 Flash Lite Preview based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$24

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$24 Free Models Router$0.00−$24 Google: Gemma 3 12B (free)$0.00−$24 Google: Gemma 3 27B (free)$0.00−$24

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Google: Gemini 3.1 Flash Lite Preview vs Arcee AI: Trinity Large Preview (free)Google: Gemini 3.1 Flash Lite Preview vs Free Models Router Google: Gemini 3.1 Flash Lite Preview vs Google: Gemma 3 12B (free)Google: Gemini 3.1 Flash Lite Preview vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.2500

Output (Completion)

$1.50

Cache Read

$0.0250

Cache Write

$0.083333

Internal Reasoning

$1.50

Image Processing

$0.2500

Web Search

$14,000.00

Audio

$0.5000

Cost Calculator

Estimate monthly spend for Google: Gemini 3.1 Flash Lite Preview based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$24

25M input + 12M output tokens