arXiv 2511.02805

MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

By Qianhao Yuan, Jie Lou, et al.

Published 2025-11-04

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Typical search agents concatenate the entire interaction history into the LLM context, preserving information integrity but producing long, noisy contexts, resulting in high computation and memory costs. In contrast, using only the current turn avoids this overhead but discards essential information. This trade-off limits the scalability of search agents. To address this challenge, we propose MemSearcher, an agent w…

View the original paper on arXiv