arXiv 2412.19505
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
By Xiaotao Hu, Wei Yin, et al.
Published 2024-12-27
Discussion
Read the public discussion and references gathered around this paper.
Recent successes in autoregressive (AR) generation models, such as the GPT series in natural language processing, have motivated efforts to replicate this success in visual tasks. Some works attempt to extend this approach to autonomous driving by building video-based world models capable of generating realistic future video sequences and predicting ego states. However, prior works tend to produce unsatisfactory res…