arXiv 2602.10556

LAP: Language-Action Pre-Training Enables Zero-shot Cross-Embodiment Transfer

By Lihan Zha, Asher J. Hancock, et al.

Published 2026-02-11

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

A long-standing goal in robotics is a generalist policy that can be deployed zero-shot on new robot embodiments without per-embodiment adaptation. Despite large-scale multi-embodiment pre-training, existing Vision-Language-Action models (VLAs) remain tightly coupled to their training embodiments and typically require costly fine-tuning. We introduce Language-Action Pre-training (LAP), a simple recipe that represents…

View the original paper on arXiv