arXiv 2005.14165
Language Models are Few-Shot Learners
By Tom B. Brown, Benjamin Mann, et al.
Published 2020-05-28
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few examples or from si…