arXiv 2510.22037

ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality

By Shayne Longpre, Sneha Kudugunta, et al.

Published 2025-10-24

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Scaling laws research has focused overwhelmingly on English -- yet the most prominent AI models explicitly serve billions of international users. In this work, we undertake the largest multilingual scaling laws study to date, totaling 774 multilingual training experiments, spanning 10M-8B model parameters, 400+ training languages and 48 evaluation languages. We introduce the Adaptive Transfer Scaling Law (ATLAS) for…

View the original paper on arXiv