arXiv 2507.05791

GTA1: GUI Test-time Scaling Agent

By Yan Yang, Dongxu Li, et al.

Published 2025-07-08

Citation lineage

Review the prior work and downstream research connected to this paper.

Graphical user interface (GUI) agents autonomously complete tasks across platforms ( , Linux) by sequentially decomposing user instructions into action proposals that iteratively interact with visual elements in the evolving environment. However, two main challenges arise: i) planning ( , the action proposal sequence) under expansive action space, where selecting an appropriate plan is non-trivial, as many valid one…

View the original paper on arXiv