arXiv 2503.15484

Value Profiles for Encoding Human Variation

By Taylor Sorensen, Pushkar Mishra, et al.

Published 2025-03-19

Discussion

Read the public discussion and references gathered around this paper.

Modelling human variation in rating tasks is crucial for personalization, pluralistic model alignment, and computational social science. We propose representing individuals using natural language value profiles -- descriptions of underlying values compressed from in-context demonstrations -- along with a steerable decoder model that estimates individual ratings from a rater representation. To measure the predictive…

View the original paper on arXiv