arXiv 2602.21320

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

By Emre Can Acikgoz, Cheng Qian, et al.

Published 2026-02-24

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically under tightly controlled training setups. It often depends on carefully constructed task-solution pairs and substantial human supervision, which creates a fundamental obstacle to op…

View the original paper on arXiv