arXiv 2510.00060

Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving

By Sheng Yang, Tong Zhan, et al.

Published 2025-09-29

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

In this work, we reconceptualize autonomous driving as a generalized language and formulate the trajectory planning task as next waypoint prediction. We introduce Max-V1, a novel framework for one-stage end-to-end autonomous driving. Our framework presents a single-pass generation paradigm that aligns with the inherent sequentiality of driving. This approach leverages the generative capacity of the VLM (Vision-Langu…

View the original paper on arXiv