arXiv 2508.15144

Mobile-Agent-v3: Fundamental Agents for GUI Automation

By Jiabo Ye, Xi Zhang, et al.

Published 2025-08-21

Citation lineage

Review the prior work and downstream research connected to this paper.

This paper introduces GUI-Owl, a foundational GUI agent model that achieves state-of-the-art performance among open-source end-to-end models on ten GUI benchmarks across desktop and mobile environments, covering grounding, question answering, planning, decision-making, and procedural knowledge. GUI-Owl-7B achieves 66.4 on AndroidWorld and 29.4 on OSWorld. Building on this, we propose Mobile-Agent-v3, a general-purpo…

View the original paper on arXiv