Z AI

GLM 4 32B

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It is made by the same lab behind the thudm models.

Input / 1M tokens: $0.100
Output / 1M tokens: $0.100
Context window: 128K tokens
Provider: Z AI
Knowledge cutoff: 2024-06-30

Performance

Median streaming throughput and first-token latency measured by Artificial Analysis.

Output tokens / sec: —
Time to first token: —