arXiv 2412.19505

DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT

By Xiaotao Hu, Wei Yin, et al.

Published 2024-12-27

Discussion

Read the public discussion and references gathered around this paper.

Recent successes in autoregressive (AR) generation models, such as the GPT series in natural language processing, have motivated efforts to replicate this success in visual tasks. Some works attempt to extend this approach to autonomous driving by building video-based world models capable of generating realistic future video sequences and predicting ego states. However, prior works tend to produce unsatisfactory res…

View the original paper on arXiv