Skip to yearly menu bar Skip to main content


Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage

Zhi Gao ⋅ Bofei Zhang ⋅ Pengxiang Li ⋅ Xiaojian Ma ⋅ Tao Yuan ⋅ Yue Fan ⋅ Yuwei Wu ⋅ Yunde Jia ⋅ Song-Chun Zhu ⋅ Qing Li

Abstract

Chat is not available.