arXiv 1312.5602
Playing Atari with Deep Reinforcement Learning
By Volodymyr Mnih, Koray Kavukcuoglu, et al.
Published 2013-12-19
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no…