arXiv 2510.01171

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

By Jiayi Zhang, Simon Yu, et al.

Published 2025-10-01

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Post-training alignment often reduces LLM diversity, leading to a phenomenon known as mode collapse. Unlike prior work that attributes this effect to algorithmic limitations, we identify a fundamental, pervasive data-level driver: typicality bias in preference data, whereby annotators systematically favor familiar text as a result of well-established findings in cognitive psychology. We formalize this bias theoretic…

View the original paper on arXiv