arXiv 1607.06450

Layer Normalization

By Jimmy Lei Ba, Jamie Ryan Kiros, et al.

Published 2016-07-21

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Training state-of-the-art, deep neural networks is computationally expensive. One way to reduce the training time is to normalize the activities of the neurons. A recently introduced technique called batch normalization uses the distribution of the summed input to a neuron over a mini-batch of training cases to compute a mean and variance which are then used to normalize the summed input to that neuron on each train…

View the original paper on arXiv