arXiv 1704.01444
Learning to Generate Reviews and Discovering Sentiment
By Alec Radford, Rafal Jozefowicz, et al.
Published 2017-04-05
Citation lineage
Review the prior work and downstream research connected to this paper.
We explore the properties of byte-level recurrent language models. When given sufficient amounts of capacity, training data, and compute time, the representations learned by these models include disentangled features corresponding to high-level concepts. Specifically, we find a single unit which performs sentiment analysis. These representations, learned in an unsupervised manner, achieve state of the art on the bin…