arXiv 2602.21320
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
By Emre Can Acikgoz, Cheng Qian, et al.
Published 2026-02-24
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically under tightly controlled training setups. It often depends on carefully constructed task-solution pairs and substantial human supervision, which creates a fundamental obstacle to op…