arXiv 2509.19736

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

By Cheng Qian, Zuxin Liu, et al.

Published 2025-09-24

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Reinforcement learning (RL) has shown promise in training agentic models that move beyond static benchmarks to engage in dynamic, multi-turn interactions. Yet, the ultimate value of such agents lies in their ability to assist users, a setting where diversity and dynamics of user interaction pose challenges. In this work, we propose UserRL, a unified framework for training and evaluating user-centric abilities throug…

View the original paper on arXiv