arXiv 2407.16833

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach

By Zhuowan Li, Cheng Li, et al.

Published 2024-07-23

Citation lineage

Review the prior work and downstream research connected to this paper.

Retrieval Augmented Generation (RAG) has been a powerful tool for Large Language Models (LLMs) to efficiently process overly lengthy contexts. However, recent LLMs like Gemini-1.5 and GPT-4 show exceptional capabilities to understand long contexts directly. We conduct a comprehensive comparison between RAG and long-context (LC) LLMs, aiming to leverage the strengths of both. We benchmark RAG and LC across various pu…

View the original paper on arXiv