arXiv 2511.18659
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning
By Jie He, Richard He Bai, et al.
Published 2025-11-24
Citation lineage
Review the prior work and downstream research connected to this paper.
Retrieval-augmented generation (RAG) enhances large language models (LLMs) with external knowledge but still suffers from long contexts and disjoint retrieval-generation optimization. In this work, we propose CLaRa (Continuous Latent Reasoning), a unified framework that performs embedding-based compression and joint optimization in a shared continuous space. To obtain semantically rich and retrievable compressed vec…