GLM-4.5 Air

Lightweight GLM 4.5 variant optimized for faster inference and lower computational costs

Parameters

N/A

Context

128,000 tokens

Released

Oct 21, 2024

Leaderboards

Average Score combining domain-specific Autobench scores; Higher is better

USD cent per average answer; Lower is better

Average Latency in Seconds; Lower is better

GLM-4.5 Air is of lower intelligence compared to average (4.1), with an intelligence score of 4.0.

GLM-4.5 Air is cheaper compared to average ($0.91 per 1M Tokens) with a price of $0.36 per 1M Tokens.

GLM-4.5 Air has a higher average latency compared to average (45.24s), with an average latency of 68.34s.

GLM-4.5 Air has a higher P99 latency compared to average (172.60s), taking 240.50s to receive the first token at P99 (TTFT).

GLM-4.5 Air has a smaller context window than average (246k tokens), with a context window of 128k tokens.