Model Cost Profile

Z.ai: GLM 5

Developer: z-ai· Tokenizer: Other · Quantization: fp4

Canonical ID: z-ai/glm-5-20260211

Pricing updated Apr 25, 2026

Input rank: #223Output rank: #232

Live Pricing

Input: $0.6000

Output: $2.08

HuggingFace ↗View full pricing leaderboard

Last synced Apr 25, 2026

Z.ai: GLM 5, developed by z-ai, offers an extensive context window of 204800 tokens, making it suitable for applications requiring deep contextual understanding, such as long-form content generation and complex data analysis. With an input price of $0.95 per 1 million tokens and an output price of $2.55 per 1 million tokens, teams can effectively manage costs while leveraging the model for tasks like customer support automation and advanced natural language processing. This pricing structure allows organizations to scale their usage based on specific project needs, optimizing budget allocation for AI-driven solutions.

💡 Enable prompt caching to save 80% on repeated input tokens ($0.1200/M cached vs $0.6000/M standard).

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

202,752

Input tokens

Full-context input ≈ $0.12

Max Output

16,384

Completion tokens

Input Price / 1M

$0.6000

Prompt tokens

Output Price / 1M

$2.08

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Z.ai: GLM 5 Pricing Trend

Input / 1M tokens-25.0%Output / 1M tokens-18.8%

Current Input / 1M

$0.6000

Current Output / 1M

$2.08

Performance History

Z.ai: GLM 5 Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

99.7%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.6000
Output (Completion)	$2.08
Cache Read	$0.1200

Compare with Z.ai: GLM 4.5 Compare with MoonshotAI: Kimi K2 Thinking Compare with OpenAI: GPT Audio Mini

Cost Calculator

Estimate monthly spend for Z.ai: GLM 5 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$40

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$40 Free Models Router$0.00−$40 Google: Gemma 3 12B (free)$0.00−$40 Google: Gemma 3 27B (free)$0.00−$40

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Z.ai: GLM 5 vs Baidu: Qianfan-OCR-Fast (free)Z.ai: GLM 5 vs Free Models Router Z.ai: GLM 5 vs Google: Gemma 3 12B (free)Z.ai: GLM 5 vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.6000

Output (Completion)

$2.08

Cache Read

$0.1200

Cost Calculator

Estimate monthly spend for Z.ai: GLM 5 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$40

25M input + 12M output tokens