arXiv 2506.15799

Steering Your Diffusion Policy with Latent Space Reinforcement Learning

By Andrew Wagenmaker, Mitsuhiko Nakamoto, et al.

Published 2025-06-18

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Robotic control policies learned from human demonstrations have achieved impressive results in many real-world applications. However, in scenarios where initial performance is not satisfactory, as is often the case in novel open-world settings, such behavioral cloning (BC)-learned policies typically require collecting additional human demonstrations to further improve their behavior -- an expensive and time-consumin…

View the original paper on arXiv