arXiv 2509.01322

LongCat-Flash Technical Report

By Meituan LongCat Team, Bayan, et al.

Published 2025-09-01

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities. Stemming from the need for scalable efficiency, LongCat-Flash adopts two novel designs: (a) Zero-computation Experts, which enables dynamic computational budget allocation and activates 18.6B-31.3B (27B on average) per token depending on contextual…

View the original paper on arXiv