arXiv 2508.15144
Mobile-Agent-v3: Fundamental Agents for GUI Automation
By Jiabo Ye, Xi Zhang, et al.
Published 2025-08-21
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
This paper introduces GUI-Owl, a foundational GUI agent model that achieves state-of-the-art performance among open-source end-to-end models on ten GUI benchmarks across desktop and mobile environments, covering grounding, question answering, planning, decision-making, and procedural knowledge. GUI-Owl-7B achieves 66.4 on AndroidWorld and 29.4 on OSWorld. Building on this, we propose Mobile-Agent-v3, a general-purpo…