arXiv 2407.21783

The Llama 3 Herd of Models

By Aaron Grattafiori, Abhimanyu Dubey, et al.

Published 2024-07-31

Citation lineage

Review the prior work and downstream research connected to this paper.

Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama…

View the original paper on arXiv