arXiv 2510.17558
The Free Transformer
By François Fleuret
Published 2025-10-20
Citation lineage
Review the prior work and downstream research connected to this paper.
We propose an extension of the decoder Transformer that conditions its generative process on random latent variables which are learned without supervision thanks to a variational procedure. Experimental evaluations show that allowing such a conditioning translates into substantial improvements on downstream tasks.