Skip to yearly menu bar Skip to main content


Poster

AgentBench: Evaluating LLMs as Agents

Xiao Liu ⋅ Hao Yu ⋅ Hanchen Zhang ⋅ Yifan Xu ⋅ Xuanyu Lei ⋅ Hanyu Lai ⋅ Yu Gu ⋅ Hangliang Ding ⋅ Kaiwen Men ⋅ Kejuan Yang ⋅ Shudan Zhang ⋅ Xiang Deng ⋅ Aohan Zeng ⋅ Zhengxiao Du ⋅ Chenhui Zhang ⋅ Sheng Shen ⋅ Tianjun Zhang ⋅ Yu Su ⋅ Huan Sun ⋅ Minlie Huang ⋅ Yuxiao Dong ⋅ Jie Tang
2024 Poster

Abstract

Video

Chat is not available.