arXiv 2505.09388

Qwen3 Technical Report

By An Yang, Anfeng Li, et al.

Published 2025-05-14

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

In this work, we present Qwen3, the latest version of the Qwen model family. Qwen3 comprises a series of large language models (LLMs) designed to advance performance, efficiency, and multilingual capabilities. The Qwen3 series includes models of both dense and Mixture-of-Expert (MoE) architectures, with parameter scales ranging from 0.6 to 235 billion. A key innovation in Qwen3 is the integration of thinking mode (f…

View the original paper on arXiv