arXiv 2510.27688

Continuous Autoregressive Language Models

By Chenze Shao, Darren Li, et al.

Published 2025-10-31

Citation lineage

Review the prior work and downstream research connected to this paper.

The efficiency of large language models (LLMs) is fundamentally limited by their sequential, token-by-token generation process. We argue that overcoming this bottleneck requires a new design axis for LLM scaling: increasing the semantic bandwidth of each generative step. To this end, we introduce Continuous Autoregressive Language Models (CALM), a paradigm shift from discrete next-token prediction to continuous next…

View the original paper on arXiv