Back to Models

Deepseek v3.2

DeepSeek V3.2 is the latest direct DeepSeek model featuring DeepSeek Sparse Attention (DSA) for high efficiency. It delivers long-context handling up to 163k tokens with reduced inference costs.

Parameters
685000000000 B
Context
163,840 tokens
Released
Jan 12, 2025

Leaderboards

Performance vs. Industry Average

Intelligence

Deepseek v3.2 is of higher intelligence compared to average (4.1), with an intelligence score of 4.1.

Price

Deepseek v3.2 is cheaper compared to average ($4.91 per 1M Tokens) with a price of $0.09 per 1M Tokens.

Latency

Deepseek v3.2 has a higher average latency compared to average (120.77s), with an average latency of 124.57s.

P99 Latency

Deepseek v3.2 has a higher P99 latency compared to average (354.03s), taking 410.46s to receive the first token at P99 (TTFT).

Context Window

Deepseek v3.2 has a smaller context window than average (347k tokens), with a context window of 164k tokens.

Deepseek v3.2 - AutoBench