Skip to yearly menu bar Skip to main content


Poster Fri, Apr 24, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 4 P4-#4705

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Wei He ⋅ Yueqing Sun ⋅ Hongyan Hao ⋅ Xueyuan Hao ⋅ Zhikang Xia ⋅ Qi GU ⋅ Hui Su ⋅ Xunliang Cai

Abstract

Log in and register to view live content