arXiv 2208.04464
In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation
By Bolin Lai, Miao Liu, et al.
Published 2022-08-08
Citation lineage
Review the prior work and downstream research connected to this paper.
In this paper, we present the first transformer-based model to address the challenging problem of egocentric gaze estimation. We observe that the connection between the global scene context and local visual information is vital for localizing the gaze fixation from egocentric video frames. To this end, we design the transformer encoder to embed the global context as one additional visual token and further propose a…