arXiv 2511.00617

Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering

By Eric Bigelow, Daniel Wurgaft, et al.

Published 2025-11-01

Citation lineage

Review the prior work and downstream research connected to this paper.

Large language models (LLMs) can be controlled at inference time through prompts (in-context learning) and internal activations (activation steering). Different accounts have been proposed to explain these methods, yet their common goal of controlling model behavior raises the question of whether these seemingly disparate methodologies can be seen as specific instances of a broader framework. Motivated by this, we d…

View the original paper on arXiv