arXiv 2502.00640
CollabLLM: From Passive Responders to Active Collaborators
By Shirley Wu, Michel Galley, et al.
Published 2025-02-02
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Large Language Models are typically trained with next-turn rewards, limiting their ability to optimize for long-term interaction. As a result, they often respond passively to ambiguous or open-ended user requests, failing to help users reach their ultimate intents and leading to inefficient conversations. To address these limitations, we introduce CollabLLM, a novel and general training framework that enhances multi…