arXiv 2511.00617
Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering
By Eric Bigelow, Daniel Wurgaft, et al.
Published 2025-11-01
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Large language models (LLMs) can be controlled at inference time through prompts (in-context learning) and internal activations (activation steering). Different accounts have been proposed to explain these methods, yet their common goal of controlling model behavior raises the question of whether these seemingly disparate methodologies can be seen as specific instances of a broader framework. Motivated by this, we dā¦