arXiv 2505.07730

Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late Interaction

By Jingfen Qiao, Jia-Huei Ju, et al.

Published 2025-05-12

Discussion

Read the public discussion and references gathered around this paper.

Visual Document Retrieval (VDR) is an emerging research area that focuses on encoding and retrieving document images directly, bypassing the dependence on Optical Character Recognition (OCR) for document search. A recent advance in VDR was introduced by ColPali, which significantly improved retrieval effectiveness through a late interaction mechanism. ColPali's approach demonstrated substantial performance gains ove…

View the original paper on arXiv