arXiv 2502.00640

CollabLLM: From Passive Responders to Active Collaborators

By Shirley Wu, Michel Galley, et al.

Published 2025-02-02

Citation lineage

Review the prior work and downstream research connected to this paper.

Large Language Models are typically trained with next-turn rewards, limiting their ability to optimize for long-term interaction. As a result, they often respond passively to ambiguous or open-ended user requests, failing to help users reach their ultimate intents and leading to inefficient conversations. To address these limitations, we introduce CollabLLM, a novel and general training framework that enhances multi…

View the original paper on arXiv