arXiv 2510.17558

The Free Transformer

By François Fleuret

Published 2025-10-20

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

We propose an extension of the decoder Transformer that conditions its generative process on random latent variables which are learned without supervision thanks to a variational procedure. Experimental evaluations show that allowing such a conditioning translates into substantial improvements on downstream tasks.

View the original paper on arXiv