arXiv 2005.14165
Language Models are Few-Shot Learners
By Tom B. Brown, Benjamin Mann, et al.
Published 2020-05-28
Citation lineage
Review the prior work and downstream research connected to this paper.
Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few examples or from si…