arXiv 2403.14753

Learning with SASQuaTCh: a Novel Variational Quantum Transformer Architecture with Kernel-Based Self-Attention

By Ethan N. Evans, Matthew Cook, et al.

Published 2024-03-21

Citation lineage

Review the prior work and downstream research connected to this paper.

The recent exploding growth in size of state-of-the-art machine learning models highlights a well-known issue where exponential parameter growth, which has grown to trillions as in the case of the Generative Pre-trained Transformer (GPT), leads to training time and memory requirements which limit their advancement in the near term. The predominant models use the so-called transformer network and have a large field o…

View the original paper on arXiv