arXiv 2510.15511

Language Models are Injective and Hence Invertible

By Giorgos Nikolaou, Tommaso Mencattini, et al.

Published 2025-10-17

Citation lineage

Review the prior work and downstream research connected to this paper.

Transformer components such as non-linear activations and normalization are inherently non-injective, suggesting that different inputs could map to the same output and prevent exact recovery of the input from a model's representations. In this paper, we challenge this view. First, we prove mathematically that transformer language models mapping discrete input sequences to their corresponding sequence of continuous r…

View the original paper on arXiv