arXiv 1312.5602

Playing Atari with Deep Reinforcement Learning

By Volodymyr Mnih, Koray Kavukcuoglu, et al.

Published 2013-12-19

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no…

View the original paper on arXiv