arXiv 2510.27688
Continuous Autoregressive Language Models
By Chenze Shao, Darren Li, et al.
Published 2025-10-31
Citation lineage
Review the prior work and downstream research connected to this paper.
The efficiency of large language models (LLMs) is fundamentally limited by their sequential, token-by-token generation process. We argue that overcoming this bottleneck requires a new design axis for LLM scaling: increasing the semantic bandwidth of each generative step. To this end, we introduce Continuous Autoregressive Language Models (CALM), a paradigm shift from discrete next-token prediction to continuous next…