arXiv 2511.00617
Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering
By Eric Bigelow, Daniel Wurgaft, et al.
Published 2025-11-01
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Large language models (LLMs) can be controlled at inference time through prompts (in-context learning) and internal activations (activation steering). Different accounts have been proposed to explain these methods, yet their common goal of controlling model behavior raises the question of whether these seemingly disparate methodologies can be seen as specific instances of a broader framework. Motivated by this, we dā¦