arXiv 2510.00060
Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving
By Sheng Yang, Tong Zhan, et al.
Published 2025-09-29
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
In this work, we reconceptualize autonomous driving as a generalized language and formulate the trajectory planning task as next waypoint prediction. We introduce Max-V1, a novel framework for one-stage end-to-end autonomous driving. Our framework presents a single-pass generation paradigm that aligns with the inherent sequentiality of driving. This approach leverages the generative capacity of the VLM (Vision-Langu…