arXiv 2505.09388
Qwen3 Technical Report
By An Yang, Anfeng Li, et al.
Published 2025-05-14
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
In this work, we present Qwen3, the latest version of the Qwen model family. Qwen3 comprises a series of large language models (LLMs) designed to advance performance, efficiency, and multilingual capabilities. The Qwen3 series includes models of both dense and Mixture-of-Expert (MoE) architectures, with parameter scales ranging from 0.6 to 235 billion. A key innovation in Qwen3 is the integration of thinking mode (f…