GLM 5.1

GLM-5.1 is an open-weight 744B MoE model (40B active) released under the MIT license. Integrating DeepSeek Sparse Attention, it matches proprietary frontier models on SWE-Bench Pro (58.4%).

Thinking Mode

Parameters

744000000000 B

Context

202,752 tokens

Released

Invalid Date

Leaderboards

QUALITY

Average Score combining domain-specific Autobench scores; Higher is better

GLM-5.1
3.06

PRICE

USD cent per average answer; Lower is better

GLM-5.1
0.50

LATENCY

Average Latency in Seconds; Lower is better

GLM-5.1
66.00s

Performance vs. Industry Average

Intelligence

GLM 5.1 is of higher intelligence compared to average (2.8), with an intelligence score of 3.1.

Price

GLM 5.1 is cheaper compared to average ($0.67 per 1M Tokens) with a price of $0.50 per 1M Tokens.

Latency

GLM 5.1 has a higher average latency compared to average (45.95s), with an average latency of 66.09s.

P99 Latency

GLM 5.1 has a higher P99 latency compared to average (131.50s), taking 197.75s to receive the first token at P99 (TTFT).

Context Window

GLM 5.1 has a smaller context window than average (401k tokens), with a context window of 203k tokens.