arXiv 2310.15213

Function Vectors in Large Language Models

By Eric Todd, Millicent L. Li, et al.

Published 2023-10-23

Citation lineage

Review the prior work and downstream research connected to this paper.

We report the presence of a simple neural mechanism that represents an input-output function as a vector within autoregressive transformer language models (LMs). Using causal mediation analysis on a diverse range of in-context-learning (ICL) tasks, we find that a small number attention heads transport a compact representation of the demonstrated task, which we call a function vector (FV). FVs are robust to changes i…

View the original paper on arXiv