GLM 5.1
GLM-5.1 is an open-weight 744B MoE model (40B active) released under the MIT license. Integrating DeepSeek Sparse Attention, it matches proprietary frontier models on SWE-Bench Pro (58.4%).
Leaderboards
Average Score combining domain-specific Autobench scores; Higher is better
Performance vs. Industry Average
Intelligence
GLM 5.1 is of higher intelligence compared to average (2.9), with an intelligence score of 3.1.
Price
GLM 5.1 is cheaper compared to average ($0.75 per 1M Tokens) with a price of $0.51 per 1M Tokens.
Latency
GLM 5.1 has a higher average latency compared to average (44.25s), with an average latency of 60.30s.
P99 Latency
GLM 5.1 has a higher P99 latency compared to average (126.46s), taking 183.27s to receive the first token at P99 (TTFT).
Context Window
GLM 5.1 has a smaller context window than average (406k tokens), with a context window of 203k tokens.