arXiv 2510.27688
Continuous Autoregressive Language Models
By Chenze Shao, Darren Li, et al.
Published 2025-10-31
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
The efficiency of large language models (LLMs) is fundamentally limited by their sequential, token-by-token generation process. We argue that overcoming this bottleneck requires a new design axis for LLM scaling: increasing the semantic bandwidth of each generative step. To this end, we introduce Continuous Autoregressive Language Models (CALM), a paradigm shift from discrete next-token prediction to continuous next…