arXiv 2511.09555

SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation

By Hao Shi, Bin Xie, et al.

Published 2025-11-12

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Robotic manipulation requires precise spatial understanding to interact with objects in the real world. Point-based methods suffer from sparse sampling, leading to the loss of fine-grained semantics. Image-based methods typically feed RGB and depth into 2D backbones pre-trained on 3D auxiliary tasks, but their entangled semantics and geometry are sensitive to inherent depth noise in real-world that disrupts semantic…

View the original paper on arXiv