arXiv 1312.5602

Playing Atari with Deep Reinforcement Learning

By Volodymyr Mnih, Koray Kavukcuoglu, et al.

Published 2013-12-19

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no…

View the original paper on arXiv