arXiv 2509.01322
LongCat-Flash Technical Report
By Meituan LongCat Team, Bayan, et al.
Published 2025-09-01
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities. Stemming from the need for scalable efficiency, LongCat-Flash adopts two novel designs: (a) Zero-computation Experts, which enables dynamic computational budget allocation and activates 18.6B-31.3B (27B on average) per token depending on contextual…