arXiv 1704.01444

Learning to Generate Reviews and Discovering Sentiment

By Alec Radford, Rafal Jozefowicz, et al.

Published 2017-04-05

Discussion

Read the public discussion and references gathered around this paper.

We explore the properties of byte-level recurrent language models. When given sufficient amounts of capacity, training data, and compute time, the representations learned by these models include disentangled features corresponding to high-level concepts. Specifically, we find a single unit which performs sentiment analysis. These representations, learned in an unsupervised manner, achieve state of the art on the bin…

View the original paper on arXiv