Mistral large 2512

Mistral Large 3 2512 is Mistral's flagship MoE model (675B total, 41B active). It offers top-tier performance in reasoning and coding.

Thinking Mode

Parameters

675000000000 B

Context

262,144 tokens

Released

Jan 12, 2025

Leaderboards

Average Score combining domain-specific Autobench scores; Higher is better

USD cent per average answer; Lower is better

Average Latency in Seconds; Lower is better

Mistral large 2512 is of lower intelligence compared to average (4.1), with an intelligence score of 3.9.

Mistral large 2512 is cheaper compared to average ($4.58 per 1M Tokens) with a price of $0.51 per 1M Tokens.

Mistral large 2512 has a lower average latency compared to average (116.45s), with an average latency of 89.96s.

Mistral large 2512 has a lower P99 latency compared to average (339.37s), taking 198.13s to receive the first token at P99 (TTFT).

Mistral large 2512 has a smaller context window than average (351k tokens), with a context window of 262k tokens.