arXiv 2510.04374
GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks
By Tejal Patwardhan, Rachel Dias, et al.
Published 2025-10-05
Citation lineage
Review the prior work and downstream research connected to this paper.
We introduce GDPval, a benchmark evaluating AI model capabilities on real-world economically valuable tasks. GDPval covers the majority of U.S. Bureau of Labor Statistics Work Activities for 44 occupations across the top 9 sectors contributing to U.S. GDP (Gross Domestic Product). Tasks are constructed from the representative work of industry professionals with an average of 14 years of experience. We find that fron…