Z AI

GLM 5.1

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on a single task for more than 8 hours, autonomously planning, executing, and improving itself throughout the process, ultimately delivering complete, engineering-grade results.

Input / 1M tokens: $1.05
Output / 1M tokens: $3.50
Context window: 203K tokens
Provider: Z AI
Cached input / 1M: $0.525

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec: 43 t/s
Time to first token: 1.22s

Benchmarks

Intelligence, coding, and math indexes plus the underlying evaluation scores.

Intelligence Index: 51
Coding Index: 43
Math Index: —
MMLU-Pro: —
GPQA: 86.8%
HLE: 28.0%
LiveCodeBench: —
SciCode: 43.8%
MATH-500: —
AIME: —

Benchmarks via Artificial Analysis