arXiv 2507.05791
GTA1: GUI Test-time Scaling Agent
By Yan Yang, Dongxu Li, et al.
Published 2025-07-08
Citation lineage
Review the prior work and downstream research connected to this paper.
Graphical user interface (GUI) agents autonomously complete tasks across platforms ( , Linux) by sequentially decomposing user instructions into action proposals that iteratively interact with visual elements in the evolving environment. However, two main challenges arise: i) planning ( , the action proposal sequence) under expansive action space, where selecting an appropriate plan is non-trivial, as many valid one…