Workshop
Workshop on Large Language Models for Agents
Xinyun Chen · Xiangru Tang · Di Jin · Devamanyu Hazarika · Daniel Fried · Dawn Song · Shafiq Joty · Meredith Morris
Halle A 8 - 9
Fri 10 May, 11:40 p.m. PDT
This workshop delves into the significance of agents driven by large language models (LLMs), a topic that has recently sparked intense discussions. Building on the current huge progress on LLMs, we'll focus on autonomous agents that perform intricate tasks in both real and simulated environments guided by natural language instructions. What sets these agents apart is their sophisticated use of language prompts, not just as a means of communication but also as a medium for reasoning—a characteristic once thought unique to humans. Our workshop specifically aims to discuss the methods, tasks, theories, and risks associated with LLM-driven agents that are capable of using language as a tool for thought and communication.
Schedule
Fri 11:40 p.m. - 11:45 p.m.
|
Opening Remarks
(
Opening Remarks
)
>
SlidesLive Video |
Xinyun Chen 🔗 |
Fri 11:45 p.m. - 12:15 a.m.
|
Invited Talk 1
(
Invited Talk
)
>
SlidesLive Video |
Denny Zhou 🔗 |
Sat 12:15 a.m. - 12:25 a.m.
|
Spotlight Presentation 1
(
Spotlight Presentation
)
>
SlidesLive Video |
🔗 |
Sat 12:30 a.m. - 12:40 a.m.
|
Spotlight Presentation 2
(
Spotlight Presentation
)
>
SlidesLive Video |
🔗 |
Sat 12:45 a.m. - 1:15 a.m.
|
Invited Talk 2
(
Invited Talk
)
>
SlidesLive Video |
Luke Zettlemoyer 🔗 |
Sat 1:20 a.m. - 1:40 a.m.
|
Coffee Break
|
🔗 |
Sat 1:40 a.m. - 1:50 a.m.
|
Spotlight Presentation 3
(
Spotlight Presentation
)
>
SlidesLive Video |
🔗 |
Sat 1:50 a.m. - 2:00 a.m.
|
Spotlight Presentation 4
(
Spotlight Presentation
)
>
SlidesLive Video |
🔗 |
Sat 2:00 a.m. - 2:30 a.m.
|
Invited Talk 3
(
Invited Talk
)
>
SlidesLive Video |
Graham Neubig 🔗 |
Sat 2:30 a.m. - 3:15 a.m.
|
Poster Session 1
(
Poster Session
)
>
|
🔗 |
Sat 3:15 a.m. - 4:15 a.m.
|
Lunch Break
|
🔗 |
Sat 4:15 a.m. - 5:00 a.m.
|
Panel Discussion
(
Panel Discussion
)
>
SlidesLive Video |
Denny Zhou · Luke Zettlemoyer · Graham Neubig · Tao Yu · Roberta Raileanu · Alexandre Drouin 🔗 |
Sat 5:00 a.m. - 5:10 a.m.
|
Spotlight Presentation 5
(
Spotlight Presentation
)
>
SlidesLive Video |
🔗 |
Sat 5:15 a.m. - 5:25 a.m.
|
Spotlight Presentation 6
(
Spotlight Presentation
)
>
SlidesLive Video |
🔗 |
Sat 5:30 a.m. - 6:30 a.m.
|
Poster Session 2
(
Poster Session
)
>
|
🔗 |
Sat 6:30 a.m. - 7:00 a.m.
|
Invited Talk 4
(
Invited Talk
)
>
SlidesLive Video |
Chelsea Finn 🔗 |
Sat 7:00 a.m. - 7:30 a.m.
|
Invited Talk 5
(
Invited Talk
)
>
SlidesLive Video |
Karthik Narasimhan 🔗 |
Sat 7:30 a.m. - 8:00 a.m.
|
Invited Talk 6
(
Invited Talk
)
>
SlidesLive Video |
Joyce Chai 🔗 |
-
|
Towards Unified Alignment Between Agents, Humans, and Environment ( Poster ) > link |
14 presentersZonghan Yang · An Liu · Zijun Liu · Kaiming Liu · Fangzhou Xiong · Yile Wang · Zeyuan Yang · Qingyuan Hu · XinRui Chen · Zhenhe Zhang · Fuwen Luo · Zhicheng Guo · Peng Li · Yang Liu |
-
|
Self-Training Language Models in Arithmetic Reasoning ( Poster ) > link | Marek Kadlčík · Michal Štefánik · Ondrej Sotolar · Vlastimil Martinek 🔗 |
-
|
R2E: Turning any Github Repository into a Programming Agent Test Environment ( Poster ) > link | Naman Jain · Manish Shetty · Tianjun Zhang · Han · Koushik Sen · Ion Stoica 🔗 |
-
|
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs ( Poster ) > link | Da Yin · Faeze Brahman · Abhilasha Ravichander · Khyathi Chandu · Kai-Wei Chang · Yejin Choi · Bill Yuchen Lin 🔗 |
-
|
LEAGUE++: EMPOWERING CONTINUAL ROBOT LEARNING THROUGH GUIDED SKILL ACQUISITION WITH LARGE LANGUAGE MODELS ( Poster ) > link | Zhaoyi Li · Kelin Yu · Shuo Cheng · Danfei Xu 🔗 |
-
|
WavCraft: Audio Editing and Generation with Large Language Models ( Poster ) > link | Jinhua Liang · Huan Zhang · Haohe Liu · Yin Cao · Qiuqiang Kong · Xubo Liu · Wenwu Wang · Mark Plumbley · Huy Phan · Emmanouil Benetos 🔗 |
-
|
SAGE: Bridging Semantic and Actionable Parts for Generalizable Manipulation of Articulated Objects ( Poster ) > link | Haoran Geng · Songlin Wei · Congyue Deng · Bokui Shen · He Wang · Leonidas Guibas 🔗 |
-
|
Simulating Opinion Dynamics with Networks of LLM-based Agents ( Poster ) > link | Yun-Shiuan Chuang · Agam Goyal · Nikunj Harlalka · SIDDHARTH SURESH · Robert Hawkins · Sijia Yang · Dhavan Shah · Junjie Hu · Timothy Rogers 🔗 |
-
|
Agents: An Open-source Framework for Autonomous Language Agents ( Poster ) > link |
14 presentersWangchunshu Zhou · Yuchen Jiang · Long Li · Jialong Wu · Tiannan Wang · Shuai Wang · Jiamin Chen · Jintian Zhang · Jing Chen · Xiangru Tang · Peng Cui · Ningyu Zhang · Huajun Chen · Mrinmaya Sachan |
-
|
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation ( Oral ) > link |
14 presentersQingyun Wu · Gagan Bansal · Jieyu Zhang · Yiran Wu · Beibin Li · Erkang Zhu · Li Jiang · Xiaoyun Zhang · Shaokun Zhang · Jiale Liu · Ahmed H Awadallah · Ryen White · Doug Burger · Chi Wang |
-
|
The Agent Ohana: Designing Unified Data and Training Pipeline for Effective Agent Learning ( Poster ) > link |
15 presentersJianguo Zhang · Tian Lan · Rithesh Murthy · Zhiwei Liu · Weiran Yao · Juntao Tan · Yihao Feng · Thai Hoang · Tulika Awalgaonkar · Liangwei Yang · Shelby Heinecke · Huan Wang · Juan Carlos Niebles · Silvio Savarese · Caiming Xiong |
-
|
Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning ( Poster ) > link | Mohamed Aghzal · Erion Plaku · Ziyu Yao 🔗 |
-
|
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design ( Poster ) > link | Haohang Li · Yangyang Yu · Zhi Chen · Yuechen Jiang · Yang Li · Denghui Zhang · Rong Liu · Jordan Suchow · Khaldoun Khashanah 🔗 |
-
|
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL ( Poster ) > link | Yifei Zhou · Andrea Zanette · Jiayi Pan · Aviral Kumar · Sergey Levine 🔗 |
-
|
Beyond A*: Better LLM planning via Search Dynamics Bootstrapping ( Poster ) > link | Lucas Lehnert · Sainbayar Sukhbaatar · Paul McVay · Michael Rabbat · Yuandong Tian 🔗 |
-
|
A-CONECT: Designing AI-based Conversational Chatbot for Early Dementia Intervention ( Poster ) > link | Junyuan Hong · Wenqing Zheng · Han Meng · Siqi Liang · Anqing Chen · Hiroko Dodge · Jiayu Zhou · Zhangyang Wang 🔗 |
-
|
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast ( Poster ) > link | Xiangming Gu · Xiaosen Zheng · Tianyu Pang · Chao Du · Qian Liu · Ye Wang · Jing Jiang · Min Lin 🔗 |
-
|
Large Language Model Evaluation Via Multi AI Agents: Preliminary results ( Poster ) > link | Zeeshan Rasheed · Muhammad Waseem · Kari Systä · Pekka Abrahamsson 🔗 |
-
|
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study ( Poster ) > link |
16 presentersWeihao Tan · Ziluo Ding · Wentao Zhang · Boyu Li · Bohan Zhou · Junpeng Yue · Haochong Xia · Jiechuan Jiang · Longtao Zheng · Xinrun Xu · Yifei Bi · Pengjie Gu · Xinrun Wang · Börje Karlsson · Bo An · Zongqing Lu |
-
|
GPT-4V(ision) is a Generalist Web Agent, if Grounded ( Poster ) > link | Boyuan Zheng · Boyu Gou · Jihyung Kil · Huan Sun · Yu Su 🔗 |
-
|
OpenAgents: An Open Platform for Language Agents in the Wild ( Poster ) > link |
16 presentersTianbao Xie · FAN ZHOU · Zhoujun Cheng · Peng Shi · Luoxuan Weng · Yitao Liu · Toh Hua · Junning Zhao · Qian Liu · Che Liu · Zeyu Liu · Yiheng Xu · Hongjin SU · Dongchan Shin · Caiming Xiong · Tao Yu |
-
|
OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models ( Poster ) > link | Yuxuan Kuang · Hai Lin · Meng Jiang 🔗 |
-
|
TravelPlanner: A Benchmark for Real-World Planning with Language Agents ( Poster ) > link | Jian Xie · Kai Zhang · Jiangjie Chen · Tinghui Zhu · Renze Lou · Yuandong Tian · Yanghua Xiao · Yu Su 🔗 |
-
|
Empowering Autonomous Driving with Large Language Models: A Safety Perspective ( Poster ) > link | Yixuan Wang · Ruochen Jiao · Simon Zhan · Chengtian Lang · Chao Huang · Zhaoran Wang · Zhuoran Yang · Qi Zhu 🔗 |
-
|
REX: Rapid Exploration and eXploitation for AI agents ( Poster ) > link |
15 presentersRithesh Murthy · Shelby Heinecke · Juan Carlos Niebles · Zhiwei Liu · Le Xue · Weiran Yao · Yihao Feng · Zeyuan Chen · Akash Gokul · Devansh Arpit · Ran Xu · Phil Mui · Huan Wang · Caiming Xiong · Silvio Savarese |
-
|
Towards Natural Language-Driven Industrial Assembly Using Foundation Models ( Poster ) > link | Omkar Joglekar · Shir Kozlovsky · Tal Lancewicki · Vladimir Tchuiev · Zohar Feldman · Dotan Di Castro 🔗 |
-
|
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception ( Poster ) > link | Junyang Wang · Haiyang Xu · Jiabo Ye · Ming Yan · Weizhou Shen · Ji Zhang · Fei Huang · Jitao Sang 🔗 |
-
|
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow ( Oral ) > link | Wenqi Zhang · Yongliang Shen · Weiming Lu · Yueting Zhuang 🔗 |
-
|
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web ( Poster ) > link | Hiroki Furuta · Yutaka Matsuo · Aleksandra Faust · Izzeddin Gur 🔗 |
-
|
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models ( Poster ) > link |
12 presentersShibo Hao · Yi Gu · Haotian Luo · Tianyang Liu · Xiyan Shao · Xinyuan Wang · Shuhua Xie · Haodi Ma · Adithya Samavedhi · Qiyue Gao · Zhen Wang · Zhiting Hu |
-
|
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents ( Poster ) > link |
12 presentersTongxin Yuan · Zhiwei He · Lingzhong Dong · Yiming Wang · Ruijie Zhao · Tian Xia · Lizhen Xu · Binglin Zhou · Li Fangqi · Zhuosheng Zhang · Rui Wang · Gongshen Liu |
-
|
LLF-Bench: Benchmark for Interactive Learning from Language Feedback ( Poster ) > link | Ching-An Cheng · Andrey Kolobov · Dipendra Kumar Misra · Allen Nie · Adith Swaminathan 🔗 |
-
|
LLM-Deliberation: Evaluating LLMs with Interactive Multi-Agent Negotiation Game ( Poster ) > link | Sahar Abdelnabi · Amr Gomaa · Sarath Sivaprasad · Lea Schönherr · Mario Fritz 🔗 |
-
|
Is it Possible to Edit Large Language Models Robustly? ( Poster ) > link | Xinbei Ma · Tianjie Ju · Jiyang Qiu · Zhuosheng Zhang · hai zhao · lifeng Liu · Yulong Wang 🔗 |
-
|
Agent Instructs Large Language Models to be General Zero-Shot Reasoners ( Poster ) > link | Nicholas Crispino · Kyle Montgomery · Fankun Zeng · Dawn Song · Chenguang Wang 🔗 |
-
|
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks? ( Poster ) > link | Alexandre Drouin · Maxime Gasse · Massimo Caccia · Issam Laradji · Manuel Del Verme · Tom Marty · David Vazquez · Nicolas Chapados · Alexandre Lacoste 🔗 |
-
|
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration ( Poster ) > link | Qiushi Sun · Zhangyue Yin · Xiang Li · Zhiyong Wu · Xipeng Qiu · Lingpeng Kong 🔗 |
-
|
ProtAgents: Protein discovery via large language model multi-agent collaborations combining physics and machine learning ( Poster ) > link | Alireza Ghafarollahi · Markus J. Buehler 🔗 |
-
|
Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation ( Poster ) > link | Zhonghan Zhao · Kewei Chen · Dongxu Guo · Wenhao Chai · Tian Ye · Yanting Zhang · Gaoang Wang 🔗 |
-
|
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records ( Poster ) > link | Wenqi Shi · Ran Xu · Yuchen Zhuang · Yue Yu · Jieyu Zhang · Hang Wu · Yuanda Zhu · Joyce Ho · Carl Yang · May Dongmei Wang 🔗 |
-
|
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models ( Poster ) > link | Zhiyuan Hu · Chumin Liu · Xidong Feng · Yilun Zhao · See-Kiong Ng · Anh Tuan Luu · Junxian He · Pang Wei Koh · Bryan Hooi 🔗 |
-
|
TaskBench: Benchmarking Large Language Models for Task Automation ( Poster ) > link | Yongliang Shen · Kaitao Song · Xu Tan · Wenqi Zhang · Kan Ren · Siyu Yuan · Weiming Lu · Dongsheng Li · Yueting Zhuang 🔗 |
-
|
SELF-IMAGINE: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination ( Poster ) > link | Syeda Nahida Akter · Aman Madaan · Sangwu Lee · Yiming Yang · Eric Nyberg 🔗 |
-
|
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments ( Poster ) > link | Yusuf Roohani · Jian Vora · Qian Huang · Percy Liang · Jure Leskovec 🔗 |
-
|
MAGIC: INVESTIGATION OF LARGE LANGUAGE MODEL POWERED MULTI-AGENT IN COGNITION, ADAPTABILITY, RATIONALITY AND COLLABORATION ( Poster ) > link | Lin Xu · Zhiyuan Hu · Zhou Daquan · Hongyu Ren · Zhen Dong · Kurt Keutzer · See-Kiong Ng · Jiashi Feng 🔗 |
-
|
Do LLM Agents Have Regret? A Case Study in Online Learning and Games ( Poster ) > link | Chanwoo Park · Xiangyu Liu · Asuman Ozdaglar · Kaiqing Zhang 🔗 |
-
|
Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science ( Poster ) > link |
13 presentersXiangru Tang · Qiao Jin · Kunlun Zhu · Tongxin Yuan · Yichi Zhang · Wangchunshu Zhou · Meng Qu · Yilun Zhao · Jian Tang · Zhuosheng Zhang · Arman Cohan · Zhiyong Lu · Mark Gerstein |
-
|
AutoAct: Automatic Agent Learning from Scratch via Self-Planning ( Oral ) > link | Shuofei Qiao · Ningyu Zhang · Runnan Fang · Yujie Luo · Wangchunshu Zhou · Yuchen Jiang · chengfei lv · Huajun Chen 🔗 |
-
|
Expressing and Exploiting Parallelism in Language Model Decoding ( Poster ) > link | Tian Jin · Ellie Cheng · Michael Carbin 🔗 |
-
|
Towards Self-Improving Language Models for Code Generation ( Poster ) > link | Michaël Defferrard · Corrado Rainone · David Zhang · Blazej Manczak · Natasha Butt · Taco Cohen 🔗 |
-
|
MathChat: Converse to Tackle Challenging Math Problems with LLM Agents ( Poster ) > link | Yiran Wu · Feiran Jia · Shaokun Zhang · Hangyu Li · Erkang Zhu · Yue Wang · Yin Tat Lee · Richard Peng · Qingyun Wu · Chi Wang 🔗 |
-
|
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects ( Poster ) > link | Yutaro Yamada · Khyathi Chandu · Bill Yuchen Lin · Jack Hessel · Ilker Yildirim · Yejin Choi 🔗 |
-
|
An Embodied Generalist Agent in 3D World ( Poster ) > link | Jiangyong Huang · Silong Yong · Xiaojian Ma · Xiongkun Linghu · Puhao Li · Yan Wang · Qing Li · Song-Chun Zhu · Baoxiong Jia · Siyuan Huang 🔗 |
-
|
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization ( Poster ) > link | Wenqi Zhang · Ke Tang · Hai Wu · Mengna Wang · Yongliang Shen · Guiyang Hou · Zeqi Tan · Peng Li · Yueting Zhuang · Weiming Lu 🔗 |
-
|
Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement ( Poster ) > link | Wonseok Jeon · Mukul Gagrani · Raghavv Goel · Junyoung Park · Mingu Lee · Christopher Lott 🔗 |
-
|
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks ( Poster ) > link | Jing Yu Koh · Robert Lo · Lawrence Jang · Vikram Duvvur · Ming Lim · Po-Yu Huang · Graham Neubig · Shuyan Zhou · Ruslan Salakhutdinov · Daniel Fried 🔗 |
-
|
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models ( Poster ) > link | Gabriel Sarch · Sahil Somani · Raghav Kapoor · Michael Tarr · Katerina Fragkiadaki 🔗 |
-
|
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach ( Poster ) > link |
12 presentersBin Zhang · Hangyu Mao · Jingqing Ruan · Ying Wen · YANG LI · Shao Zhang · Zhiwei Xu · Dapeng Li · Ziyue Li · Rui Zhao · Lijuan Li · Guoliang Fan |
-
|
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks ( Poster ) > link | Murtaza Dalal · Tarun Chiruvolu · Devendra Chaplot · Ruslan Salakhutdinov 🔗 |
-
|
Adapting Uni-Modal Language Models for Dense Multi-Modal Co-Reference Resolution using Parameter Augmentation ( Poster ) > link | Samuel Osebe · Prashan Wanigasekara · Thanh Tran · Thomas Gueudre 🔗 |
-
|
Preference-Conditioned Language-Guided Abstraction ( Poster ) > link | Andi Peng · Andreea Bobu · Belinda Li · Theodore Sumers · Ilia Sucholutsky · Nishanth Kumar · Thomas L. Griffiths · Julie Shah 🔗 |
-
|
S-Agent: self-organizing agents in open-ended environment ( Poster ) > link | Jiaqi Chen · Yuxian Jiang · Jiachen Lu · Li Zhang 🔗 |
-
|
Efficient Human-AI Coordination via Preparatory Language-based Convention ( Poster ) > link | Cong Guan · Lichao Zhang · Chunpeng Fan · Yi-Chen Li · Feng Chen · Lihe Li · Yunjia Tian · Lei Yuan · Yang Yu 🔗 |
-
|
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents ( Poster ) > link | Kanzhi Cheng · Qiushi Sun · Yougang Chu · Fangzhi Xu · Li YanTao · Jianbing Zhang · Zhiyong Wu 🔗 |
-
|
The ART of LLM Refinement: Ask, Refine, Trust ( Poster ) > link | Kumar Shridhar · Koustuv Sinha · Andrew Cohen · Tianlu Wang · Ping Yu · Ramakanth Pasunuru · Mrinmaya Sachan · Jason E Weston · Asli Celikyilmaz 🔗 |
-
|
SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code ( Poster ) > link | Ziniu Hu 🔗 |
-
|
LangProp: A code optimization framework using Large Language Models applied to driving ( Poster ) > link | Shu Ishida · Gianluca Corrado · George Fedoseev · Hudson Yeo · Lloyd Russell · Jamie Shotton · Joao F. Henriques · Anthony Hu 🔗 |
-
|
FL-TAC: Enhanced Fine-Tuning in Federated Learning via Low-Rank, Task-Specific Adapter Clustering ( Poster ) > link | Siqi Ping · Yuzhu Mao · Yang Liu · Xiao-Ping Zhang · Wenbo Ding 🔗 |
-
|
EcoAssistant: Using LLM Assistants More Affordably and Accurately ( Poster ) > link | Jieyu Zhang · Ranjay Krishna · Ahmed H Awadallah · Chi Wang 🔗 |
-
|
IntentGPT: Few-shot Intent Discovery with Large Language Models ( Poster ) > link | Juan A. Rodriguez · Nicholas Botzer · David Vazquez · Christopher Pal · Marco Pedersoli · Issam Laradji 🔗 |
-
|
Large Language Models can Strategically Deceive their Users when Put Under Pressure ( Oral ) > link | Jérémy Scheurer · Mikita Balesni · Marius Hobbhahn 🔗 |
-
|
Language-guided Skill Learning with Temporal Variational Inference ( Poster ) > link | Haotian Fu · Pratyusha Sharma · Elias Stengel-Eskin · George D Konidaris · Nicolas Le Roux · Marc-Alexandre Cote · Eric Yuan 🔗 |
-
|
Decision-Oriented Dialogue for Human-AI Collaboration ( Poster ) > link | Jessy Lin · Nicholas Tomlin · Jacob Andreas · Jason Eisner 🔗 |
-
|
Making Retrieval-Augmented Language Models Robust to Irrelevant Context ( Poster ) > link | Ori Yoran · Tomer Wolfson · Ori Ram · Jonathan Berant 🔗 |
-
|
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning ( Poster ) > link | Xiangru Tang · Anni Zou · Zhuosheng Zhang · Ziming Li · Yilun Zhao · Xingyao Zhang · Arman Cohan · Mark Gerstein 🔗 |
-
|
Collaborative LLM-Agents for Editable Driving Scene Simulation ( Poster ) > link | Yuxi Wei · Zi Wang · Yifan Lu · Chenxin Xu · Changxing Liu · Hao Zhao · Siheng Chen · Yanfeng Wang 🔗 |
-
|
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue ( Poster ) > link | Xing Han Lu · Zdeněk Kasner · Siva Reddy 🔗 |
-
|
The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents ( Poster ) > link | Yun-Shiuan Chuang · Nikunj Harlalka · SIDDHARTH SURESH · Agam Goyal · Robert Hawkins · Sijia Yang · Dhavan Shah · Junjie Hu · Timothy Rogers 🔗 |
-
|
BOLAA: BENCHMARKING AND ORCHESTRATING LLM AUTONOMOUS AGENTS ( Poster ) > link |
15 presentersZhiwei Liu · Weiran Yao · Jianguo Zhang · Le Xue · Shelby Heinecke · Rithesh Murthy · Yihao Feng · Zeyuan Chen · Juan Carlos Niebles · Devansh Arpit · Ran Xu · Phil Mui · Huan Wang · Caiming Xiong · Silvio Savarese |
-
|
Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems ( Poster ) > link |
13 presentersYilun Kong · Jingqing Ruan · YiHong Chen · Bin Zhang · Tianpeng Bao · shi shiwei · du qing · xiaoru hu · Hangyu Mao · Ziyue Li · Xingyu Zeng · Rui Zhao · Xueqian Wang |
-
|
Executable Code Actions Elicit Better LLM Agents ( Oral ) > link | Xingyao Wang · Yangyi Chen · Lifan Yuan · Yizhe Zhang · Yunzhu Li · Hao Peng · Heng Ji 🔗 |
-
|
Self-Alignment of Large Language Models via Multi-Agent Social Simulation ( Poster ) > link | Xianghe Pang · Shuo Tang · Rui Ye · Yuxin Xiong · Bolun Zhang · Yanfeng Wang · Siheng Chen 🔗 |
-
|
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents ( Poster ) > link |
11 presentersKe Yang · Jiateng Liu · John Wu · Chaoqi Yang · Yi Fung · Sha Li · Zixuan Huang · Xu Cao · Xingyao Wang · Heng Ji · ChengXiang Zhai |
-
|
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent ( Poster ) > link |
13 presentersRenat Aksitov · Sobhan Miryoosefi · Zonglin Li · Daliang Li · Sheila Babayan · Kavya Kopparapu · Zachary Fisher · Ruiqi Guo · Sushant Prakash · Pranesh Srinivasan · Manzil Zaheer · Felix Yu · Sanjiv Kumar |
-
|
Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View ( Oral ) > link | Jintian Zhang · Xin Xu · Ningyu Zhang · Ruibo Liu · Bryan Hooi · Shumin Deng 🔗 |
-
|
Are Machines Better at Slow Thinking? Unveiling Human-Machine Inference Gaps in Entailment Verification ( Poster ) > link | Soumya Sanyal · Tianyi Xiao · Jiacheng Liu · Wenya Wang · Xiang Ren 🔗 |
-
|
Limitations of Agents Simulated by Predictive Models ( Poster ) > link | Raymond Douglas · Jacek Karwowski · Chan Bae · Andis Draguns · Victoria Krakovna 🔗 |
-
|
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement ( Poster ) > link | Zhiyong Wu · Chengcheng Han · Zichen Ding · Zhenmin Weng · Zhoumianze Liu · Shunyu Yao · Tao Yu · Lingpeng Kong 🔗 |
-
|
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction ( Poster ) > link | Siyu Yuan · Kaitao Song · Jiangjie Chen · Xu Tan · Yongliang Shen · Kan Ren · Dongsheng Li · Deqing Yang 🔗 |
-
|
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets ( Poster ) > link | Seonghyeon Ye · Doyoung Kim · Sungdong Kim · Hyeonbin Hwang · Seungone Kim · Yongrae Jo · James Thorne · Juho Kim · Minjoon Seo 🔗 |
-
|
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models ( Poster ) > link | Andy Zhou · Kai Yan · Michal Shlapentokh-Rothman · Haohan Wang · Yu-Xiong Wang 🔗 |
-
|
On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent ( Poster ) > link |
18 presentersLicheng Wen · Xuemeng Yang · DAOCHENG FU · Xiaofeng Wang · Pinlong Cai · Xin Li · Tao MA · Yingxuan Li · Linran XU · Dengke Shang · Zheng Zhu · Shaoyan Sun · Yeqi BAI · Xinyu Cai · Min Dou · Shuanglu Hu · Botian Shi · Yu Qiao |
-
|
Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA ( Poster ) > link | Dhruv Agarwal · Rajarshi Das · Sopan Khosla · Rashmi Gangadharaiah 🔗 |
-
|
Open-TI: Open Traffic Intelligence with Augmented Language Model ( Poster ) > link | Longchao Da · Kuan-Ru Liou · Tiejin Chen · Xuesong Zhou · Xiangyong Luo · 'YZ' Yezhou Yang · Hua Wei 🔗 |
-
|
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents ( Poster ) > link | Chang Ma · Junlei Zhang · Zhihao Zhu · Cheng Yang · Yujiu Yang · Yaohui Jin · Zhenzhong Lan · Lingpeng Kong · Junxian He 🔗 |
-
|
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts ( Poster ) > link | Kuang-Huei Lee · Xinyun Chen · Hiroki Furuta · Ian Fischer 🔗 |