MiniMax M2.5

MiniMax M2.5 is a hyper-efficient 230B MoE model (10B active) trained via large-scale RL in 200,000+ environments. It excels in office productivity, outputting at 100 tokens/sec at unprecedented cost efficiency.

Thinking Mode

Parameters

230000000000 B

Context

196,608 tokens

Released

Invalid Date

Leaderboards

QUALITY

Average Score combining domain-specific Autobench scores; Higher is better

Minimax-m2.5
2.72

PRICE

USD cent per average answer; Lower is better

Minimax-m2.5
0.05

LATENCY

Average Latency in Seconds; Lower is better

Minimax-m2.5
85.00s

Performance vs. Industry Average

Intelligence

MiniMax M2.5 is of lower intelligence compared to average (2.8), with an intelligence score of 2.7.

Price

MiniMax M2.5 is cheaper compared to average ($0.67 per 1M Tokens) with a price of $0.05 per 1M Tokens.

Latency

MiniMax M2.5 has a higher average latency compared to average (45.95s), with an average latency of 85.20s.

P99 Latency

MiniMax M2.5 has a higher P99 latency compared to average (131.50s), taking 240.74s to receive the first token at P99 (TTFT).

Context Window

MiniMax M2.5 has a smaller context window than average (401k tokens), with a context window of 197k tokens.