Z AI
GLM 5 Turbo
GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows involving long execution chains, with improved complex instruction decomposition, tool use, scheduled and persistent execution, and overall stability across extended tasks.
- Input / 1M tokens
- $1.20
- Output / 1M tokens
- $4.00
- Context window
- 203K tokens
- Provider
- Z AI
- Cached input / 1M
- $0.240
Performance
Median streaming throughput and first-token latency measured by Artificial Analysis.
- Output tokens / sec
- 0 t/s
- Time to first token
- 0.00s
Benchmarks
Intelligence, coding, and math indexes plus the underlying evaluation scores.
- Intelligence Index
- 47
- Coding Index
- 37
- Math Index
- —
- MMLU-Pro
- —
- GPQA
- 84.7%
- HLE
- 25.4%
- LiveCodeBench
- —
- SciCode
- 43.6%
- MATH-500
- —
- AIME
- —
Benchmarks via Artificial Analysis