arXiv 2510.27688
Continuous Autoregressive Language Models
By Chenze Shao, Darren Li, et al.
Published 2025-10-31
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
The efficiency of large language models (LLMs) is fundamentally limited by their sequential, token-by-token generation process. We argue that overcoming this bottleneck requires a new design axis for LLM scaling: increasing the semantic bandwidth of each generative step. To this end, we introduce Continuous Autoregressive Language Models (CALM), a paradigm shift from discrete next-token prediction to continuous next…