arXiv 2502.00640

CollabLLM: From Passive Responders to Active Collaborators

By Shirley Wu, Michel Galley, et al.

Published 2025-02-02

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Large Language Models are typically trained with next-turn rewards, limiting their ability to optimize for long-term interaction. As a result, they often respond passively to ambiguous or open-ended user requests, failing to help users reach their ultimate intents and leading to inefficient conversations. To address these limitations, we introduce CollabLLM, a novel and general training framework that enhances multi…

View the original paper on arXiv