arXiv 2510.00060

Less is More: Lean yet Powerful Vision-Language Model for Autonomous Driving

By Sheng Yang, Tong Zhan, et al.

Published 2025-09-29

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

In this work, we reconceptualize autonomous driving as a generalized language and formulate the trajectory planning task as next waypoint prediction. We introduce Max-V1, a novel framework for one-stage end-to-end autonomous driving. Our framework presents a single-pass generation paradigm that aligns with the inherent sequentiality of driving. This approach leverages the generative capacity of the VLM (Vision-Langu…

View the original paper on arXiv