arXiv 2505.11709

EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

By Ryan Hoque, Peide Huang, et al.

Published 2025-05-16

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Imitation learning for manipulation has a well-known data scarcity problem. Unlike natural language and 2D computer vision, there is no Internet-scale corpus of data for dexterous manipulation. One appealing option is egocentric human video, a passively scalable data source. However, existing large-scale datasets such as Ego4D do not have native hand pose annotations and do not focus on object manipulation. To this…

View the original paper on arXiv