Context Window
204,800
Tokens
Model Cost Profile
Developer: z-ai
Pricing updated Mar 11, 2026
Z.ai: GLM 4.6, developed by z-ai, offers a substantial context window of 202,752 tokens, making it ideal for applications requiring extensive text comprehension, such as legal document analysis and long-form content generation. Teams leveraging this API model can expect an input cost of $0.35 per million tokens and an output cost of $1.71 per million tokens, which can influence budgeting for projects with high token usage. This pricing structure allows organizations to scale their usage efficiently while managing costs associated with large-scale natural language processing tasks.
Context Window
204,800
Tokens
Input Price / 1M
$0.3900
Prompt tokens
Output Price / 1M
$1.90
Completion tokens
Intelligence (MMLU)
Benchmark Pending
Massive Multitask Language Understanding
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $0.3900 |
| Output (Completion) | $1.90 |
Price History
Current Input / 1M
$0.3900
Current Output / 1M
$1.90
Estimate monthly spend for Z.ai: GLM 4.6 based on your workload.
Estimated Monthly Cost
$33
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.
Common pricing and benchmark questions for Z.ai: GLM 4.6.
Z.ai: GLM 4.6 input pricing is $0.3900 per 1M tokens based on the latest synced provider data.
Z.ai: GLM 4.6 output pricing is $1.90 per 1M tokens based on the latest synced provider data.
Z.ai: GLM 4.6 supports a context window of 204,800 tokens.
Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.