arXiv 2602.21320

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

By Emre Can Acikgoz, Cheng Qian, et al.

Published 2026-02-24

Citation lineage

Review the prior work and downstream research connected to this paper.

Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically under tightly controlled training setups. It often depends on carefully constructed task-solution pairs and substantial human supervision, which creates a fundamental obstacle to op…

View the original paper on arXiv