arXiv 2412.19505
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
By Xiaotao Hu, Wei Yin, et al.
Published 2024-12-27
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Recent successes in autoregressive (AR) generation models, such as the GPT series in natural language processing, have motivated efforts to replicate this success in visual tasks. Some works attempt to extend this approach to autonomous driving by building video-based world models capable of generating realistic future video sequences and predicting ego states. However, prior works tend to produce unsatisfactory res…