arXiv 2509.19012
Pure Vision Language Action (VLA) Models: A Comprehensive Survey
By Dapeng Zhang, Jing Sun, et al.
Published 2025-09-23
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
The emergence of Vision Language Action (VLA) models marks a paradigm shift from traditional policy-based control to generalized robotics, reframing Vision Language Models (VLMs) from passive sequence generators into active agents for manipulation and decision-making in complex, dynamic environments. This survey delves into advanced VLA methods, aiming to provide a clear taxonomy and a systematic, comprehensive revi…