Z AI

GLM 5 Turbo

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows involving long execution chains, with improved complex instruction decomposition, tool use, scheduled and persistent execution, and overall stability across extended tasks.

Input / 1M tokens: $1.20
Output / 1M tokens: $4.00
Context window: 203K tokens
Provider: Z AI
Cached input / 1M: $0.240

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec: 0 t/s
Time to first token: 0.00s

Benchmarks

Intelligence, coding, and math indexes plus the underlying evaluation scores.

Intelligence Index: 47
Coding Index: 37
Math Index: —
MMLU-Pro: —
GPQA: 84.7%
HLE: 25.4%
LiveCodeBench: —
SciCode: 43.6%
MATH-500: —
AIME: —

Benchmarks via Artificial Analysis