arXiv 2507.20526

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

By Andy Zou, Maxwell Lin, et al.

Published 2025-07-28

Discussion

Read the public discussion and references gathered around this paper.

Recent advances have enabled LLM-powered AI agents to autonomously execute complex tasks by combining language model reasoning with tools, memory, and web access. But can these systems be trusted to follow deployment policies in realistic environments, especially under attack? To investigate, we ran the largest public red-teaming competition to date, targeting 22 frontier AI agents across 44 realistic deployment sce…

View the original paper on arXiv