Context Window
131,072
Input tokens
Full-context input โ $0.08
Model Cost Profile
Developer: z-aiยท Tokenizer: Other ยท Quantization: fp8
Canonical ID: z-ai/glm-4.5
Pricing updated Apr 21, 2026
Live Pricing
Input: $0.6000
Output: $2.20
Last synced Apr 21, 2026
Z.ai: GLM 4.5 offers an extensive context window of 131,000 tokens, making it ideal for applications requiring deep contextual understanding, such as legal document analysis and long-form content generation. With an input price of $0.55 per 1 million tokens and an output price of $2.00 per 1 million tokens, teams can effectively manage costs based on their specific usage patterns. This pricing structure allows organizations to scale their projects efficiently while leveraging the model's capabilities for complex tasks across various industries.
Context Window
131,072
Input tokens
Full-context input โ $0.08
Max Output
98,304
Completion tokens
Input Price / 1M
$0.6000
Prompt tokens
Output Price / 1M
$2.20
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.6000
Current Output / 1M
$2.20
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.6000 |
| Output (Completion) | $2.20 |
| Cache Read | $0.1100 |
Estimate monthly spend for Z.ai: GLM 4.5 based on your workload.
Estimated Monthly Cost
$41
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.