arXiv 2509.01322
LongCat-Flash Technical Report
By Meituan LongCat Team, Bayan, et al.
Published 2025-09-01
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities. Stemming from the need for scalable efficiency, LongCat-Flash adopts two novel designs: (a) Zero-computation Experts, which enables dynamic computational budget allocation and activates 18.6B-31.3B (27B on average) per token depending on contextual…