arXiv 2205.09707
PLAID: An Efficient Engine for Late Interaction Retrieval
By Keshav Santhanam, Omar Khattab, et al.
Published 2022-05-19
Discussion
Read the public discussion and references gathered around this paper.
Pre-trained language models are increasingly important components across multiple information retrieval (IR) paradigms. Late interaction, introduced with the ColBERT model and recently refined in ColBERTv2, is a popular paradigm that holds state-of-the-art status across many benchmarks. To dramatically speed up the search latency of late interaction, we introduce the Performance-optimized Late Interaction Driver (PL…