arXiv 2510.04374

GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks

By Tejal Patwardhan, Rachel Dias, et al.

Published 2025-10-05

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

We introduce GDPval, a benchmark evaluating AI model capabilities on real-world economically valuable tasks. GDPval covers the majority of U.S. Bureau of Labor Statistics Work Activities for 44 occupations across the top 9 sectors contributing to U.S. GDP (Gross Domestic Product). Tasks are constructed from the representative work of industry professionals with an average of 14 years of experience. We find that fron…

View the original paper on arXiv