arXiv 2510.04374

GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks

By Tejal Patwardhan, Rachel Dias, et al.

Published 2025-10-05

Citation lineage

Review the prior work and downstream research connected to this paper.

We introduce GDPval, a benchmark evaluating AI model capabilities on real-world economically valuable tasks. GDPval covers the majority of U.S. Bureau of Labor Statistics Work Activities for 44 occupations across the top 9 sectors contributing to U.S. GDP (Gross Domestic Product). Tasks are constructed from the representative work of industry professionals with an average of 14 years of experience. We find that fron…

View the original paper on arXiv