Context Window
204,800
Input tokens
Full-context input โ $0.08
Model Cost Profile
Developer: z-aiยท Tokenizer: Other ยท Quantization: fp8
Canonical ID: z-ai/glm-4.6
Pricing updated Apr 24, 2026
Live Pricing
Input: $0.3900
Output: $1.90
Last synced Apr 24, 2026
Z.ai: GLM 4.6, developed by z-ai, offers a substantial context window of 202,752 tokens, making it ideal for applications requiring extensive text comprehension, such as legal document analysis and long-form content generation. Teams leveraging this API model can expect an input cost of $0.35 per million tokens and an output cost of $1.71 per million tokens, which can influence budgeting for projects with high token usage. This pricing structure allows organizations to scale their usage efficiently while managing costs associated with large-scale natural language processing tasks.
Context Window
204,800
Input tokens
Full-context input โ $0.08
Max Output
204,800
Completion tokens
Input Price / 1M
$0.3900
Prompt tokens
Output Price / 1M
$1.90
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$0.3900
Current Output / 1M
$1.90
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
98.8%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.3900 |
| Output (Completion) | $1.90 |
Estimate monthly spend for Z.ai: GLM 4.6 based on your workload.
Estimated Monthly Cost
$33
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.