arXiv 2511.02805
MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning
By Qianhao Yuan, Jie Lou, et al.
Published 2025-11-04
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Typical search agents concatenate the entire interaction history into the LLM context, preserving information integrity but producing long, noisy contexts, resulting in high computation and memory costs. In contrast, using only the current turn avoids this overhead but discards essential information. This trade-off limits the scalability of search agents. To address this challenge, we propose MemSearcher, an agent w…