arXiv 2005.14165

Language Models are Few-Shot Learners

By Tom B. Brown, Benjamin Mann, et al.

Published 2020-05-28

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few examples or from si…

View the original paper on arXiv