arXiv 2511.10811

Transformers know more than they can tell -- Learning the Collatz sequence

By François Charton and Ashvni Narayanan

Published 2025-11-13

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

We investigate transformer prediction of long Collatz steps, a complex arithmetic function that maps odd integers to their distant successors in the Collatz sequence ( if is even, if is odd). Model accuracy varies with the base used to encode input and output. It can be as high as for bases and , and as low as and for bases and . Yet, all models, no matter the base, follow a common learning pattern. As training proc…

View the original paper on arXiv