arXiv 2511.02303
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
By Zhiwei Zhang, Xiaomin Li, et al.
Published 2025-11-04
Discussion
Read the public discussion and references gathered around this paper.
Large Language Models (LLMs) trained with reinforcement learning and verifiable rewards have achieved strong results on complex reasoning tasks. Recent work extends this paradigm to a multi-agent setting, where a meta-thinking agent proposes plans and monitors progress while a reasoning agent executes subtasks through sequential conversational turns. Despite promising performance, we identify a critical limitation:…