Z AI
GLM 5.1
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on a single task for more than 8 hours, autonomously planning, executing, and improving itself throughout the process, ultimately delivering complete, engineering-grade results.
- Input / 1M tokens
- $1.05
- Output / 1M tokens
- $3.50
- Context window
- 203K tokens
- Provider
- Z AI
- Cached input / 1M
- $0.525
Performance
Median streaming throughput and first-token latency measured by Artificial Analysis.
- Output tokens / sec
- 43 t/s
- Time to first token
- 1.22s
Benchmarks
Intelligence, coding, and math indexes plus the underlying evaluation scores.
- Intelligence Index
- 51
- Coding Index
- 43
- Math Index
- —
- MMLU-Pro
- —
- GPQA
- 86.8%
- HLE
- 28.0%
- LiveCodeBench
- —
- SciCode
- 43.8%
- MATH-500
- —
- AIME
- —
Benchmarks via Artificial Analysis