arXiv 2204.02311

PaLM: Scaling Language Modeling with Pathways

By Aakanksha Chowdhery, Sharan Narang, et al.

Published 2022-04-05

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language mo…

View the original paper on arXiv