arXiv 2204.02311
PaLM: Scaling Language Modeling with Pathways
By Aakanksha Chowdhery, Sharan Narang, et al.
Published 2022-04-05
Citation lineage
Review the prior work and downstream research connected to this paper.
Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language mo…