arXiv 2511.02805

MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

By Qianhao Yuan, Jie Lou, et al.

Published 2025-11-04

Discussion

Read the public discussion and references gathered around this paper.

Typical search agents concatenate the entire interaction history into the LLM context, preserving information integrity but producing long, noisy contexts, resulting in high computation and memory costs. In contrast, using only the current turn avoids this overhead but discards essential information. This trade-off limits the scalability of search agents. To address this challenge, we propose MemSearcher, an agent w…

View the original paper on arXiv