Skip to yearly menu bar Skip to main content


CyberGym-E2E: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities

Tianneng Shi ⋅ Robin Rheem ⋅ Dongwei Jiang ⋅ Mona Wang ⋅ Francisco De La Riega ⋅ ZHUN WANG ⋅ Jingzhi Jiang ⋅ Alexander Cheung ⋅ Sean Tai ⋅ Jonah Cha ⋅ Jianhong Tu ⋅ Gabriel Han ⋅ Chenguang Wang ⋅ Wenbo Guo ⋅ Jingxuan He ⋅ Dawn Song

Abstract

Chat is not available.