arXiv 1312.5602
Playing Atari with Deep Reinforcement Learning
By Volodymyr Mnih, Koray Kavukcuoglu, et al.
Published 2013-12-19
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no…