Context Window
202,752
Input tokens
Full-context input โ $0.24
Model Cost Profile
Developer: z-aiยท Tokenizer: Other ยท Quantization: fp8
Canonical ID: z-ai/glm-5v-turbo-20260401
Pricing updated May 16, 2026
GLM-5V-Turbo is Z.aiโs first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...
Context Window
202,752
Input tokens
Full-context input โ $0.24
Max Output
131,072
Completion tokens
Input Price / 1M
$1.20
Prompt tokens
Output Price / 1M
$4.00
Completion tokens
Top Benchmark
Pending
No benchmark data yet
Price History
Current Input / 1M
$1.20
Current Output / 1M
$4.00
Performance History
Current TPS
0.00
Current Latency
0ms
Uptime
100.0%
| Usage Type | Price / 1M Tokens |
|---|---|
| Input (Prompt) | $1.20 |
| Output (Completion) | $4.00 |
| Cache Read | $0.2400 |
Estimate monthly spend for Z.ai: GLM 5V Turbo based on your workload.
Estimated Monthly Cost
$78
25M input + 12M output tokens
Quick links for cost-down decisions before production rollout.