arXiv 2507.05791
GTA1: GUI Test-time Scaling Agent
By Yan Yang, Dongxu Li, et al.
Published 2025-07-08
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Graphical user interface (GUI) agents autonomously complete tasks across platforms ( , Linux) by sequentially decomposing user instructions into action proposals that iteratively interact with visual elements in the evolving environment. However, two main challenges arise: i) planning ( , the action proposal sequence) under expansive action space, where selecting an appropriate plan is non-trivial, as many valid one…