arXiv 2412.19505

DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT

By Xiaotao Hu, Wei Yin, et al.

Published 2024-12-27

Citation lineage

Review the prior work and downstream research connected to this paper.

Recent successes in autoregressive (AR) generation models, such as the GPT series in natural language processing, have motivated efforts to replicate this success in visual tasks. Some works attempt to extend this approach to autonomous driving by building video-based world models capable of generating realistic future video sequences and predicting ego states. However, prior works tend to produce unsatisfactory res…

View the original paper on arXiv