arXiv 2511.19936

Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

By Youngseo Kim, Dohyun Kim, et al.

Published 2025-11-25

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Image diffusion models, though originally developed for image generation, implicitly capture rich semantic structures that enable various recognition and localization tasks beyond synthesis. In this work, we investigate their self-attention maps can be reinterpreted as semantic label propagation kernels, providing robust pixel-level correspondences between relevant image regions. Extending this mechanism across fram…

View the original paper on arXiv