arXiv 2509.19736
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning
By Cheng Qian, Zuxin Liu, et al.
Published 2025-09-24
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Reinforcement learning (RL) has shown promise in training agentic models that move beyond static benchmarks to engage in dynamic, multi-turn interactions. Yet, the ultimate value of such agents lies in their ability to assist users, a setting where diversity and dynamics of user interaction pose challenges. In this work, we propose UserRL, a unified framework for training and evaluating user-centric abilities throug…