arXiv 2205.09707

PLAID: An Efficient Engine for Late Interaction Retrieval

By Keshav Santhanam, Omar Khattab, et al.

Published 2022-05-19

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Pre-trained language models are increasingly important components across multiple information retrieval (IR) paradigms. Late interaction, introduced with the ColBERT model and recently refined in ColBERTv2, is a popular paradigm that holds state-of-the-art status across many benchmarks. To dramatically speed up the search latency of late interaction, we introduce the Performance-optimized Late Interaction Driver (PL…

View the original paper on arXiv