arXiv 2505.03729
Visual Imitation Enables Contextual Humanoid Control
By Arthur Allshire, Hongsuk Choi, et al.
Published 2025-05-06
Discussion
Read the public discussion and references gathered around this paper.
How can we teach humanoids to climb staircases and sit on chairs using the surrounding environment context? Arguably, the simplest way is to just show them-casually capture a human motion video and feed it to humanoids. We introduce VIDEOMIMIC, a real-to-sim-to-real pipeline that mines everyday videos, jointly reconstructs the humans and the environment, and produces whole-body control policies for humanoid robots t…