arXiv 2405.01434

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

By Yupeng Zhou, Daquan Zhou, et al.

Published 2024-05-02

Discussion

Read the public discussion and references gathered around this paper.

For recent diffusion-based generative models, maintaining consistent content across a series of generated images, especially those containing subjects and complex details, presents a significant challenge. In this paper, we propose a new way of self-attention calculation, termed Consistent Self-Attention, that significantly boosts the consistency between the generated images and augments prevalent pretrained diffusi…

View the original paper on arXiv