arXiv 2511.03773

Scaling Agent Learning via Experience Synthesis

By Zhaorun Chen, Zhuokai Zhao, et al.

Published 2025-11-05

Citation lineage

Review the prior work and downstream research connected to this paper.

While reinforcement learning (RL) can empower large language model (LLM) agents by enabling self-improvement through interaction, its practical adoption remains challenging due to costly rollouts, limited task diversity, unreliable reward signals, and infrastructure complexity, all of which obstruct the collection of scalable experience data. To address these challenges, we introduce DreamGym, the first unified fram…

View the original paper on arXiv