arXiv 2604.11035

Introspective Diffusion Language Models

By Yifan Yu, Yuqing Jian, et al.

Published 2026-04-13

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Diffusion language models promise parallel generation, yet still lag behind autoregressive (AR) models in quality. We stem this gap to a failure of introspective consistency: AR models agree with their own generations, while DLMs often do not. We define the introspective acceptance rate, which measures whether a model accepts its previously generated tokens. This reveals why AR training has a structural advantage: c…

View the original paper on arXiv