arXiv 2208.04464

In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation

By Bolin Lai, Miao Liu, et al.

Published 2022-08-08

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

In this paper, we present the first transformer-based model to address the challenging problem of egocentric gaze estimation. We observe that the connection between the global scene context and local visual information is vital for localizing the gaze fixation from egocentric video frames. To this end, we design the transformer encoder to embed the global context as one additional visual token and further propose a…

View the original paper on arXiv