arXiv 2510.27688

Continuous Autoregressive Language Models

By Chenze Shao, Darren Li, et al.

Published 2025-10-31

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

The efficiency of large language models (LLMs) is fundamentally limited by their sequential, token-by-token generation process. We argue that overcoming this bottleneck requires a new design axis for LLM scaling: increasing the semantic bandwidth of each generative step. To this end, we introduce Continuous Autoregressive Language Models (CALM), a paradigm shift from discrete next-token prediction to continuous next…

View the original paper on arXiv