arXiv 2602.21320

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

By Emre Can Acikgoz, Cheng Qian, et al.

Published 2026-02-24

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Large language models (LLMs) are becoming the foundation for autonomous agents that can use tools to solve complex tasks. Reinforcement learning (RL) has emerged as a common approach for injecting such agentic capabilities, but typically under tightly controlled training setups. It often depends on carefully constructed task-solution pairs and substantial human supervision, which creates a fundamental obstacle to op…

View the original paper on arXiv