arXiv 2510.17558

The Free Transformer

By François Fleuret

Published 2025-10-20

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

We propose an extension of the decoder Transformer that conditions its generative process on random latent variables which are learned without supervision thanks to a variational procedure. Experimental evaluations show that allowing such a conditioning translates into substantial improvements on downstream tasks.

View the original paper on arXiv