Model Cost Profile

Z.ai: GLM 4.6

Developer: z-ai· Tokenizer: Other · Quantization: fp8

Canonical ID: z-ai/glm-4.6

Pricing updated Apr 24, 2026

Input rank: #188Output rank: #212

Live Pricing

Input: $0.3900

Output: $1.90

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026

Z.ai: GLM 4.6, developed by z-ai, offers a substantial context window of 202,752 tokens, making it ideal for applications requiring extensive text comprehension, such as legal document analysis and long-form content generation. Teams leveraging this API model can expect an input cost of $0.35 per million tokens and an output cost of $1.71 per million tokens, which can influence budgeting for projects with high token usage. This pricing structure allows organizations to scale their usage efficiently while managing costs associated with large-scale natural language processing tasks.

🔧 Tool Calling🔌 MCP Compatible📋 Structured Output🧠 Reasoning

Context Window

204,800

Input tokens

Full-context input ≈ $0.08

Max Output

204,800

Completion tokens

Input Price / 1M

$0.3900

Prompt tokens

Output Price / 1M

$1.90

Completion tokens

Top Benchmark

Pending

No benchmark data yet

Price History

Z.ai: GLM 4.6 Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.3900

Current Output / 1M

$1.90

Performance History

Z.ai: GLM 4.6 Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

98.8%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.3900
Output (Completion)	$1.90

Compare with Z.ai: GLM 4.7 Compare with Qwen: Qwen3.5 397B A17B Compare with DeepSeek: DeepSeek V3.2 Speciale

Cost Calculator

Estimate monthly spend for Z.ai: GLM 4.6 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$33

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$33 Free Models Router$0.00−$33 Google: Gemma 3 12B (free)$0.00−$33 Google: Gemma 3 27B (free)$0.00−$33

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Z.ai: GLM 4.6 vs Baidu: Qianfan-OCR-Fast (free)Z.ai: GLM 4.6 vs Free Models Router Z.ai: GLM 4.6 vs Google: Gemma 3 12B (free)Z.ai: GLM 4.6 vs Google: Gemma 3 27B (free)

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.3900

Output (Completion)

$1.90

Cost Calculator

Estimate monthly spend for Z.ai: GLM 4.6 based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$33

25M input + 12M output tokens