arXiv 2509.14233

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

By Alejandro Hernández-Cano, Alexander Hägele, et al.

Published 2025-09-17

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

We present Apertus, a fully open suite of large language models (LLMs) designed to address two systemic shortcomings in today's open model ecosystem: data compliance and multilingual representation. Unlike many prior models that release weights without reproducible data pipelines or regard for content-owner rights, Apertus models are pretrained exclusively on openly available data, retroactively respecting robots.tx…

View the original paper on arXiv