arXiv 2502.00640
CollabLLM: From Passive Responders to Active Collaborators
By Shirley Wu, Michel Galley, et al.
Published 2025-02-02
Citation lineage
Review the prior work and downstream research connected to this paper.
Large Language Models are typically trained with next-turn rewards, limiting their ability to optimize for long-term interaction. As a result, they often respond passively to ambiguous or open-ended user requests, failing to help users reach their ultimate intents and leading to inefficient conversations. To address these limitations, we introduce CollabLLM, a novel and general training framework that enhances multi…