arXiv 2503.15484
Value Profiles for Encoding Human Variation
By Taylor Sorensen, Pushkar Mishra, et al.
Published 2025-03-19
Discussion
Read the public discussion and references gathered around this paper.
Modelling human variation in rating tasks is crucial for personalization, pluralistic model alignment, and computational social science. We propose representing individuals using natural language value profiles -- descriptions of underlying values compressed from in-context demonstrations -- along with a steerable decoder model that estimates individual ratings from a rater representation. To measure the predictive…