arXiv 1412.6980

Adam: A Method for Stochastic Optimization

By Diederik P. Kingma and Jimmy Ba

Published 2014-12-22

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient, has little memory requirements, is invariant to diagonal rescaling of the gradients, and is well suited for problems that are large in terms of data and/or parameters. The method is…

View the original paper on arXiv