Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 4 P4-#5010

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Zhenting Wang ⋅ Qi Chang ⋅ Hemani Patel ⋅ Shashank Biju ⋅ Cheng-En Wu ⋅ Quan Liu ⋅ Aolin Ding ⋅ Alireza Rezazadeh ⋅ Ankit Parag Shah ⋅ Yujia Bao ⋅ Eugene Siow

Abstract

Log in and register to view live content