arXiv 2604.11035
Introspective Diffusion Language Models
By Yifan Yu, Yuqing Jian, et al.
Published 2026-04-13
Discussion
Read the public discussion and references gathered around this paper.
Diffusion language models promise parallel generation, yet still lag behind autoregressive (AR) models in quality. We stem this gap to a failure of introspective consistency: AR models agree with their own generations, while DLMs often do not. We define the introspective acceptance rate, which measures whether a model accepts its previously generated tokens. This reveals why AR training has a structural advantage: cā¦