arXiv 2511.02303

Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation

By Zhiwei Zhang, Xiaomin Li, et al.

Published 2025-11-04

Discussion

Read the public discussion and references gathered around this paper.

Large Language Models (LLMs) trained with reinforcement learning and verifiable rewards have achieved strong results on complex reasoning tasks. Recent work extends this paradigm to a multi-agent setting, where a meta-thinking agent proposes plans and monitors progress while a reasoning agent executes subtasks through sequential conversational turns. Despite promising performance, we identify a critical limitation:…

View the original paper on arXiv