arXiv 2511.00617
Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering
By Eric Bigelow, Daniel Wurgaft, et al.
Published 2025-11-01
Citation lineage
Review the prior work and downstream research connected to this paper.
Large language models (LLMs) can be controlled at inference time through prompts (in-context learning) and internal activations (activation steering). Different accounts have been proposed to explain these methods, yet their common goal of controlling model behavior raises the question of whether these seemingly disparate methodologies can be seen as specific instances of a broader framework. Motivated by this, we dā¦