Context Window
202,752
Input tokens
Full-context input โ $0.12
Model Cost Profile
Developer: z-aiยท Tokenizer: Other ยท Quantization: fp4
Canonical ID: z-ai/glm-5-20260211
Pricing updated Apr 25, 2026
Live Pricing
Input: $0.6000
Output: $2.08
Last synced Apr 25, 2026
Z.ai: GLM 5, developed by z-ai, offers an extensive context window of 204800 tokens, making it suitable for applications requiring deep contextual understanding, such as long-form content generation and complex data analysis. With an input price of $0.95 per 1 million tokens and an output price of $2.55 per 1 million tokens, teams can effectively manage costs while leveraging the model for tasks like customer support automation and advanced natural language processing. This pricing structure allows organizations to scale their usage based on specific project needs, optimizing budget allocation for AI-driven solutions.
Context Window
202,752
Input tokens
Full-context input โ $0.12
Max Output
16,384
Completion tokens
Input Price / 1M
$0.6000
Prompt tokens
Output Price / 1M
$2.08
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.6000
Current Output / 1M
$2.08
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
99.7%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.6000 |
| Output (Completion) | $2.08 |
| Cache Read | $0.1200 |
Estimate monthly spend for Z.ai: GLM 5 based on your workload.
Estimated Monthly Cost
$40
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.