arXiv 2502.00640
CollabLLM: From Passive Responders to Active Collaborators
By Shirley Wu, Michel Galley, et al.
Published 2025-02-02
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Large Language Models are typically trained with next-turn rewards, limiting their ability to optimize for long-term interaction. As a result, they often respond passively to ambiguous or open-ended user requests, failing to help users reach their ultimate intents and leading to inefficient conversations. To address these limitations, we introduce CollabLLM, a novel and general training framework that enhances multi…