arXiv 2509.14233
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
By Alejandro Hernández-Cano, Alexander Hägele, et al.
Published 2025-09-17
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
We present Apertus, a fully open suite of large language models (LLMs) designed to address two systemic shortcomings in today's open model ecosystem: data compliance and multilingual representation. Unlike many prior models that release weights without reproducible data pipelines or regard for content-owner rights, Apertus models are pretrained exclusively on openly available data, retroactively respecting robots.tx…