arXiv 2310.04363
Amortizing intractable inference in large language models
By Edward J. Hu, Moksh Jain, et al.
Published 2023-10-06
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Autoregressive large language models (LLMs) compress knowledge from their training data through next-token conditional distributions. This limits tractable querying of this knowledge to start-to-end autoregressive sampling. However, many tasks of interest -- including sequence continuation, infilling, and other forms of constrained generation -- involve sampling from intractable posterior distributions. We address t…