arXiv 2506.15799
Steering Your Diffusion Policy with Latent Space Reinforcement Learning
By Andrew Wagenmaker, Mitsuhiko Nakamoto, et al.
Published 2025-06-18
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Robotic control policies learned from human demonstrations have achieved impressive results in many real-world applications. However, in scenarios where initial performance is not satisfactory, as is often the case in novel open-world settings, such behavioral cloning (BC)-learned policies typically require collecting additional human demonstrations to further improve their behavior -- an expensive and time-consumin…