Model Cost Profile

Z.ai: GLM 4.6V

Developer: z-ai· Tokenizer: Other · Quantization: fp8

Canonical ID: z-ai/glm-4.6-20251208

Pricing updated Apr 24, 2026

Input rank: #182Output rank: #165

Live Pricing

Input: $0.3000

Output: $0.9000

HuggingFace ↗View full pricing leaderboard

Last synced Apr 24, 2026 · MMLU score via public benchmark data

Z.ai: GLM 4.6V offers an extensive context window of 131072 tokens, making it suitable for applications requiring deep contextual understanding, such as legal document analysis and long-form content generation. Teams leveraging this API model can expect input costs of $0.30 per million tokens and output costs of $0.90 per million tokens, allowing for flexible budgeting based on usage patterns. This pricing structure is particularly advantageous for businesses that handle large volumes of text, enabling them to optimize costs while maximizing the model's capabilities.

👁 Vision🔧 Tool Calling🔌 MCP Compatible🧠 Reasoning

Context Window

131,072

Input tokens

Full-context input ≈ $0.04

Max Output

131,072

Completion tokens

Input Price / 1M

$0.3000

Prompt tokens

Output Price / 1M

$0.9000

Completion tokens

Top Benchmark

79.9

MMLU score — highest of MMLU, GPQA, MATH, HumanEval

Quality & Benchmarks

Evaluation scores for Z.ai: GLM 4.6V. The “Top Benchmark” shown above is the highest score across MMLU, GPQA, MATH & HumanEval.

Benchmark	Score	Rank	Source
GPQA	71.9	#48 of 125	artificial_analysis
MMLU	79.9	#48 of 127	artificial_analysis

Price History

Z.ai: GLM 4.6V Pricing Trend

Input / 1M tokens0.0%Output / 1M tokens0.0%

Current Input / 1M

$0.3000

Current Output / 1M

$0.9000

Performance History

Z.ai: GLM 4.6V Speed Trend

Tokens/sec (higher is better)Latency (lower is better)

Current TPS

0.00

Current Latency

0ms

Uptime

95.0%

Side-by-Side Pricing Table

Usage Type	Price / 1M Tokens
Input (Prompt)	$0.3000
Output (Completion)	$0.9000

Compare with Z.ai: GLM 4.7 Compare with Amazon: Nova 2 Lite Compare with Google: Gemini 2.5 Flash

Cost Calculator

Estimate monthly spend for Z.ai: GLM 4.6V based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$18

25M input + 12M output tokens

Same Workload on Other Models

Baidu: Qianfan-OCR-Fast (free)$0.00−$18 Free Models Router$0.00−$18 Google: Gemma 3 12B (free)$0.00−$18 Google: Gemma 3 27B (free)$0.00−$18

Cheaper Alternatives to Compare

Quick links for cost-down decisions before production rollout.

Z.ai: GLM 4.6V vs Baidu: Qianfan-OCR-Fast (free)Z.ai: GLM 4.6V vs Free Models Router Z.ai: GLM 4.6V vs Google: Gemma 3 12B (free)Z.ai: GLM 4.6V vs Google: Gemma 3 27B (free)

Benchmark

Score

Rank

Source

GPQA

71.9

#48 of 125

artificial_analysis

MMLU

79.9

#48 of 127

artificial_analysis

Usage Type

Price / 1M Tokens

Input (Prompt)

$0.3000

Output (Completion)

$0.9000

Cost Calculator

Estimate monthly spend for Z.ai: GLM 4.6V based on your workload.

Input tokens / month

01.0B

Output tokens / month

0500M

Estimated Monthly Cost

$18

25M input + 12M output tokens