Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 4 P4-#4004

Holistic Agent Leaderboard: The Missing Infrastructure for AI Agent Evaluation

Sayash Kapoor ⋅ Benedikt Stroebl ⋅ Peter Kirgis ⋅ Nitya Nadgir ⋅ Zachary Siegel ⋅ Boyi Wei ⋅ Tianci Xue ⋅ Ziru Chen ⋅ Felix Chen ⋅ Saiteja Utpala ⋅ Franck Ndzomga ⋅ Dheeraj Oruganty ⋅ Sophie Luskin ⋅ Kangheng Liu ⋅ Botao Yu ⋅ Amit Arora ⋅ Dongyoon Hahm ⋅ Harsh Trivedi ⋅ Huan Sun ⋅ Juyong Lee ⋅ Tengjun Jin ⋅ Yifan Mai ⋅ Yifei Zhou ⋅ Yuxuan Zhu ⋅ Rishi Bommasani ⋅ Daniel Kang ⋅ Dawn Song ⋅ Peter Henderson ⋅ Yu Su ⋅ Percy Liang ⋅ Arvind Narayanan

Abstract

Log in and register to view live content