arXiv 2508.09494

Learning Facts at Scale with Active Reading

By Jessy Lin, Vincent-Pierre Berges, et al.

Published 2025-08-13

Discussion

Read the public discussion and references gathered around this paper.

LLMs are known to store vast amounts of knowledge in their parametric memory. However, learning and recalling facts from this memory is known to be unreliable, depending largely on the prevalence of particular facts in the training data and other factors which are poorly understood. Practitioners are lacking tools which will allow them to ensure that the models learn a given body of knowledge reliably and consistent…

View the original paper on arXiv