arXiv 2512.02556

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

By DeepSeek-AI, Aixin Liu, et al.

Published 2025-12-02

Citation lineage

Review the prior work and downstream research connected to this paper.

We introduce DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance. The key technical breakthroughs of DeepSeek-V3.2 are as follows: (1) DeepSeek Sparse Attention (DSA): We introduce DSA, an efficient attention mechanism that substantially reduces computational complexity while preserving model performance in long-context scenarios. (2) Scalable Reinforcem…

View the original paper on arXiv