arXiv 2503.14858

1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities

By Kevin Wang, Ishaan Javali, et al.

Published 2025-03-19

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Scaling up self-supervised learning has driven breakthroughs in language and vision, yet comparable progress has remained elusive in reinforcement learning (RL). In this paper, we study building blocks for self-supervised RL that unlock substantial improvements in scalability, with network depth serving as a critical factor. Whereas most RL papers in recent years have relied on shallow architectures (around 2 - 5 la…

View the original paper on arXiv