arXiv 1803.10122

World Models

By David Ha and Jürgen Schmidhuber

Published 2018-03-27

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

We explore building generative neural network models of popular reinforcement learning environments. Our world model can be trained quickly in an unsupervised manner to learn a compressed spatial and temporal representation of the environment. By using features extracted from the world model as inputs to an agent, we can train a very compact and simple policy that can solve the required task. We can even train our a…

View the original paper on arXiv