arXiv 2604.10167
Visual Late Chunking: An Empirical Study of Contextual Chunking for Efficient Visual Document Retrieval
By Yibo Yan, Mingdong Ou, et al.
Published 2026-04-11
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Multi-vector models dominate Visual Document Retrieval (VDR) due to their fine-grained matching capabilities, but their high storage and computational costs present a major barrier to practical deployment. In this paper, we propose ColChunk, a plug-and-play framework that introduces multimodal late chunking to construct efficient, contextualized multi-vectors. Unlike existing pruning or fixed-token approaches, ColCh…