arXiv 2604.11035

Introspective Diffusion Language Models

By Yifan Yu, Yuqing Jian, et al.

Published 2026-04-13

Discussion

Read the public discussion and references gathered around this paper.

Diffusion language models promise parallel generation, yet still lag behind autoregressive (AR) models in quality. We stem this gap to a failure of introspective consistency: AR models agree with their own generations, while DLMs often do not. We define the introspective acceptance rate, which measures whether a model accepts its previously generated tokens. This reveals why AR training has a structural advantage: c…

View the original paper on arXiv