arXiv 2509.19736

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

By Cheng Qian, Zuxin Liu, et al.

Published 2025-09-24

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Reinforcement learning (RL) has shown promise in training agentic models that move beyond static benchmarks to engage in dynamic, multi-turn interactions. Yet, the ultimate value of such agents lies in their ability to assist users, a setting where diversity and dynamics of user interaction pose challenges. In this work, we propose UserRL, a unified framework for training and evaluating user-centric abilities throug…

View the original paper on arXiv