arXiv 2507.05791

GTA1: GUI Test-time Scaling Agent

By Yan Yang, Dongxu Li, et al.

Published 2025-07-08

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Graphical user interface (GUI) agents autonomously complete tasks across platforms ( , Linux) by sequentially decomposing user instructions into action proposals that iteratively interact with visual elements in the evolving environment. However, two main challenges arise: i) planning ( , the action proposal sequence) under expansive action space, where selecting an appropriate plan is non-trivial, as many valid one…

View the original paper on arXiv