arXiv 2204.02311

PaLM: Scaling Language Modeling with Pathways

By Aakanksha Chowdhery, Sharan Narang, et al.

Published 2022-04-05

Citation lineage

Review the prior work and downstream research connected to this paper.

Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language mo…

View the original paper on arXiv