Model Cost Profile

Z.ai: GLM 5

Developer: z-ai

Pricing updated Mar 11, 2026

Input rank: #223Output rank: #232

Live Pricing

Input: $0.7200

Output: $2.30

Pricing via OpenRouter API ยท Last synced Mar 11, 2026

Z.ai: GLM 5, developed by z-ai, offers an extensive context window of 204800 tokens, making it suitable for applications requiring deep contextual understanding, such as long-form content generation and complex data analysis. With an input price of $0.95 per 1 million tokens and an output price of $2.55 per 1 million tokens, teams can effectively manage costs while leveraging the model for tasks like customer support automation and advanced natural language processing. This pricing structure allows organizations to scale their usage based on specific project needs, optimizing budget allocation for AI-driven solutions.

๐Ÿ”ง Tool Calling๐Ÿ“‹ Structured Output๐Ÿง  Reasoning

Context Window

202,752

Tokens

Input Price / 1M

$0.7200

Prompt tokens

Output Price / 1M

$2.30

Completion tokens

Intelligence (MMLU)

Benchmark Pending

Massive Multitask Language Understanding

Price History

Z.ai: GLM 5 Pricing Trend

Input / 1M tokens-10.0%Output / 1M tokens-10.2%
Mar 7 โ€” Mar 11
$0.7200$1.64$2.56Mar 7Mar 8Mar 9Mar 10Mar 11

Current Input / 1M

$0.7200

Current Output / 1M

$2.30

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

FAQ

Common pricing and benchmark questions for Z.ai: GLM 5.

How much does Z.ai: GLM 5 cost per 1M input tokens?

Z.ai: GLM 5 input pricing is $0.7200 per 1M tokens based on the latest synced provider data.

How much does Z.ai: GLM 5 cost per 1M output tokens?

Z.ai: GLM 5 output pricing is $2.30 per 1M tokens based on the latest synced provider data.

What context window does Z.ai: GLM 5 support?

Z.ai: GLM 5 supports a context window of 202,752 tokens.

How can I compare Z.ai: GLM 5 with cheaper alternatives?

Use the comparison links on this page to open direct model-vs-model pricing and benchmark pages, then evaluate monthly spend projections for your workload.