Model Cost Profile

Google: Gemma 4 31B

Developer: google· Tokenizer: Gemma · Quantization: fp4

Canonical ID: google/gemma-4-31b-it-20260402

Pricing updated May 20, 2026

Input rank: #105Output rank: #95

Live Pricing

Input: $0.1200

Output: $0.3700

Visit Google ↗HuggingFace ↗View full pricing leaderboard

Last synced May 20, 2026

Google's Gemma 4 31B model offers an extensive context window of 262,144 tokens, making it suitable for applications that require processing large volumes of text, such as legal document analysis or long-form content generation. With an input cost of $0.13 per million tokens and an output cost of $0.38 per million tokens, teams can effectively manage their budgets while leveraging advanced AI capabilities for tasks like customer support automation or data extraction. This model's scalability and pricing structure make it an attractive option for businesses looking to integrate AI into their workflows without incurring prohibitive costs.

👁 Vision🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Provider Compliance

SOC 2HIPAAFedRAMPGDPRISO 27001

Context Window

262,144

Input tokens

Full-context input ≈ $0.03

Max Output

16,384

Completion tokens

Input Price / 1M

$0.1200

Prompt tokens

Output Price / 1M

$0.3700

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History