Context Window
128,000
Tokens
Model Cost Profile
Developer: z-ai
Pricing updated Mar 11, 2026
Z.ai: GLM 4 32B, developed by z-ai, offers an extensive context window of 128,000 tokens, making it ideal for complex applications such as long-form content generation and in-depth data analysis. With a competitive pricing structure of $0.10 per million tokens for both input and output, teams can efficiently manage costs while leveraging the model's capabilities for large-scale projects. This model is particularly beneficial for enterprises requiring high-volume processing, as it allows for seamless integration into workflows without significant financial overhead.
Context Window
128,000
Tokens
Input Price / 1M
$0.1000
Prompt tokens
Output Price / 1M
$0.1000
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.1000 |
| Output (Completion) | $0.1000 |
Price History
Current Input / 1M
$0.1000
Current Output / 1M
$0.1000
Estimate monthly spend for Z.ai: GLM 4 32B based on your workload.
Estimated Monthly Cost
$3.70
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Z.ai: GLM 4 32B .
Z.ai: GLM 4 32B input pricing is $0.1000 per 1M tokens based on the latest synced provider data.
Z.ai: GLM 4 32B output pricing is $0.1000 per 1M tokens based on the latest synced provider data.
Z.ai: GLM 4 32B supports a context window of 128,000 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.