arXiv 2604.11035
Introspective Diffusion Language Models
By Yifan Yu, Yuqing Jian, et al.
Published 2026-04-13
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Diffusion language models promise parallel generation, yet still lag behind autoregressive (AR) models in quality. We stem this gap to a failure of introspective consistency: AR models agree with their own generations, while DLMs often do not. We define the introspective acceptance rate, which measures whether a model accepts its previously generated tokens. This reveals why AR training has a structural advantage: cā¦