arXiv 2505.09388
Qwen3 Technical Report
By An Yang, Anfeng Li, et al.
Published 2025-05-14
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
In this work, we present Qwen3, the latest version of the Qwen model family. Qwen3 comprises a series of large language models (LLMs) designed to advance performance, efficiency, and multilingual capabilities. The Qwen3 series includes models of both dense and Mixture-of-Expert (MoE) architectures, with parameter scales ranging from 0.6 to 235 billion. A key innovation in Qwen3 is the integration of thinking mode (f…