arXiv 2511.02303
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
By Zhiwei Zhang, Xiaomin Li, et al.
Published 2025-11-04
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Large Language Models (LLMs) trained with reinforcement learning and verifiable rewards have achieved strong results on complex reasoning tasks. Recent work extends this paradigm to a multi-agent setting, where a meta-thinking agent proposes plans and monitors progress while a reasoning agent executes subtasks through sequential conversational turns. Despite promising performance, we identify a critical limitation:…