arXiv 2507.20526
Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition
By Andy Zou, Maxwell Lin, et al.
Published 2025-07-28
Discussion
Read the public discussion and references gathered around this paper.
Recent advances have enabled LLM-powered AI agents to autonomously execute complex tasks by combining language model reasoning with tools, memory, and web access. But can these systems be trusted to follow deployment policies in realistic environments, especially under attack? To investigate, we ran the largest public red-teaming competition to date, targeting 22 frontier AI agents across 44 realistic deployment sce…