Model Cost Profile

Google: Gemma 3 4B

Developer: google· Tokenizer: Gemini · Instruct: gemma · Quantization: bf16

Canonical ID: google/gemma-3-4b-it

Pricing updated Apr 23, 2026

Input rank: #41Output rank: #35

Live Pricing

Input: $0.0400

Output: $0.0800

Visit Google ↗HuggingFace ↗View full pricing leaderboard

Last synced Apr 23, 2026

Google's Gemma 3 4B model features an extensive context window of 131,072 tokens, making it ideal for applications requiring in-depth analysis such as legal document review or extensive content generation. With an input price of $0.04 per million tokens and an output price of $0.08 per million tokens, teams can effectively manage costs while leveraging the model for tasks like customer support automation or data summarization. This pricing structure allows organizations to scale their usage based on project needs, ensuring budget flexibility for various API-driven initiatives.

👁 Vision📋 Structured Output

Provider Compliance

SOC 2HIPAAFedRAMPGDPRISO 27001

Context Window

131,072

Input tokens

Full-context input ≈ $0.01

Max Output

—

Not specified

Input Price / 1M

$0.0400

Prompt tokens

Output Price / 1M

$0.0800

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Google: Gemma 3 4B Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.0400

Current Output / 1M

$0.0800

Performance History

Google: Gemma 3 4B Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

100.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.0400
Output (Completion)	$0.0800

Compare with Google: Gemma 3 12B Compare with NVIDIA: Nemotron Nano 9B V2 Compare with Qwen: Qwen2.5 7B Instruct

Cost Calculator

Estimate monthly spend for Google: Gemma 3 4B based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$1.96

25M input + 12M output tokens

Same Workload on Other Models

Arcee AI: Trinity Large Preview (free)$0.00−$1.96 Free Models Router$0.00−$1.96 Google: Gemma 3 12B (free)$0.00−$1.96 Google: Gemma 3 27B (free)$0.00−$1.96

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Google: Gemma 3 4B vs Arcee AI: Trinity Large Preview (free)Google: Gemma 3 4B vs Free Models Router Google: Gemma 3 4B vs Google: Gemma 3 12B (free)Google: Gemma 3 4B vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.0400

Output (Completion)

$0.0800