arXiv 2510.18234

DeepSeek-OCR: Contexts Optical Compression

By Haoran Wei, Yaofeng Sun, et al.

Published 2025-10-21

Discussion

Read the public discussion and references gathered around this paper.

We present DeepSeek-OCR as an initial investigation into the feasibility of compressing long contexts via optical 2D mapping. DeepSeek-OCR consists of two components: DeepEncoder and DeepSeek3B-MoE-A570M as the decoder. Specifically, DeepEncoder serves as the core engine, designed to maintain low activations under high-resolution input while achieving high compression ratios to ensure an optimal and manageable numbe…

View the original paper on arXiv