Model Cost Profile

Google: Gemma 3 4B (free)

Developer: google· Tokenizer: Gemini · Instruct: gemma · Quantization: unknown

Canonical ID: google/gemma-3-4b-it

Pricing updated Apr 22, 2026

Input rank: #5Output rank: #5

Live Pricing

Input: $0.0000

Output: $0.0000

Visit Google ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 22, 2026

Google's Gemma 3 4B model offers a substantial context window of 32,768 tokens, making it suitable for applications that require extensive text analysis or generation, such as summarizing long documents or engaging in complex conversations. As a free API model, it allows teams to leverage advanced natural language processing capabilities without incurring input or output costs, making it an attractive option for startups and research projects. This cost-effective solution enables organizations to experiment and scale their AI initiatives without financial barriers, facilitating innovation in various sectors.

👁 Vision📋 Structured Output

Provider Compliance

SOC 2HIPAAFedRAMPGDPRISO 27001

Context Window

32,768

Input tokens

Full-context input ≈ $0.00

Max Output

8,192

Completion tokens

Input Price / 1M

$0.0000

Prompt tokens

Output Price / 1M

$0.0000

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Google: Gemma 3 4B (free) Pricing Trend

Input / 1M tokensOutput / 1M tokens

Current Input / 1M

$0.000000

Current Output / 1M

$0.000000

Performance History

Google: Gemma 3 4B (free) Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0000
Output (Completion)	$0.0000

Compare with Google: Gemma 3 12B (free)Compare with Arcee AI: Trinity Large Preview (free)Compare with Free Models Router

Cost Calculator

Estimate monthly spend for Google: Gemma 3 4B (free) based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$0.00

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00 Free Models Router$0.00 Google: Gemma 3 12B (free)$0.00 Google: Gemma 3 27B (free)$0.00

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Google: Gemma 3 4B (free) vs Arcee AI: Trinity Large Preview (free)Google: Gemma 3 4B (free) vs Free Models Router Google: Gemma 3 4B (free) vs Google: Gemma 3 12B (free)Google: Gemma 3 4B (free) vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0000

Output (Completion)

$0.0000

Cost Calculator

Estimate monthly spend for Google: Gemma 3 4B (free) based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$0.00

25M input + 12M output tokens