arXiv 2510.17558
The Free Transformer
By François Fleuret
Published 2025-10-20
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
We propose an extension of the decoder Transformer that conditions its generative process on random latent variables which are learned without supervision thanks to a variational procedure. Experimental evaluations show that allowing such a conditioning translates into substantial improvements on downstream tasks.