arXiv 2511.02303

Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation

By Zhiwei Zhang, Xiaomin Li, et al.

Published 2025-11-04

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Large Language Models (LLMs) trained with reinforcement learning and verifiable rewards have achieved strong results on complex reasoning tasks. Recent work extends this paradigm to a multi-agent setting, where a meta-thinking agent proposes plans and monitors progress while a reasoning agent executes subtasks through sequential conversational turns. Despite promising performance, we identify a critical limitation:…

View the original paper on arXiv