arXiv 2510.04999

Bridging Text and Video Generation: A Survey

By Nilay Kumar, Priyansh Bhandari, et al.

Published 2025-10-06

Citation lineage

Review the prior work and downstream research connected to this paper.

Text-to-video (T2V) generation technology holds potential to transform multiple domains such as education, marketing, entertainment, and assistive technologies for individuals with visual or reading comprehension challenges, by creating coherent visual content from natural language prompts. From its inception, the field has advanced from adversarial models to diffusion-based models, yielding higher-fidelity, tempora…

View the original paper on arXiv