arXiv 2505.09388

Qwen3 Technical Report

By An Yang, Anfeng Li, et al.

Published 2025-05-14

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

In this work, we present Qwen3, the latest version of the Qwen model family. Qwen3 comprises a series of large language models (LLMs) designed to advance performance, efficiency, and multilingual capabilities. The Qwen3 series includes models of both dense and Mixture-of-Expert (MoE) architectures, with parameter scales ranging from 0.6 to 235 billion. A key innovation in Qwen3 is the integration of thinking mode (f…

View the original paper on arXiv