Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

45 Results

<<   <   Page 2 of 4   >   >>
Workshop
FacePhi: Lightweight Multimodal Large Language Model for Facial Landmark Emotion Recognition
Hongjin Zhao · Zheyuan Liu · Yang Liu · Zhenyue Qin · Jiaxu Liu · Tom Gedeon
Workshop
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models
Yupan Huang · Zaiqiao Meng · Fangyu Liu · Yixuan Su · Nigel Collier · Yutong Lu
Workshop
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Xiangming Gu · Xiaosen Zheng · Tianyu Pang · Chao Du · Qian Liu · Ye Wang · Jing Jiang · Min Lin
Workshop
SELF-IMAGINE: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination
Syeda Nahida Akter · Aman Madaan · Sangwu Lee · Yiming Yang · Eric Nyberg
Workshop
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks
Jing Yu Koh · Robert Lo · Lawrence Jang · Vikram Duvvur · Ming Lim · Po-Yu Huang · Graham Neubig · Shuyan Zhou · Ruslan Salakhutdinov · Daniel Fried
Workshop
Sat 2:10 Invited Talk: Building a Multimodal Dataset for African languages
Claytone Sikasote
Workshop
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study
Weihao Tan · Ziluo Ding · Wentao Zhang · Boyu Li · Bohan Zhou · Junpeng Yue · Haochong Xia · Jiechuan Jiang · Longtao Zheng · Xinrun Xu · Yifei Bi · Pengjie Gu · Xinrun Wang · Börje Karlsson · Bo An · Zongqing Lu
Poster
Tue 1:45 DreamLLM: Synergistic Multimodal Comprehension and Creation
Runpei Dong · chunrui han · Yuang Peng · Zekun Qi · Zheng Ge · Jinrong Yang · Liang Zhao · Jianjian Sun · Hongyu Zhou · Haoran Wei · Xiangwen Kong · Xiangyu Zhang · Kaisheng Ma · Li Yi
Poster
Thu 7:30 Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Xichen Pan · Li Dong · Shaohan Huang · Zhiliang Peng · Wenhu Chen · Furu Wei
Poster
Tue 7:30 EQA-MX: Embodied Question Answering using Multimodal Expression
Md Mofijul Islam · Alexi Gladstone · Riashat Islam · Tariq Iqbal
Poster
Fri 1:45 Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data
Antonis Antoniades · Yiyi Yu · Joe Canzano · William Wang · Spencer Smith
Poster
Tue 1:45 Deep Generative Clustering with Multimodal Diffusion Variational Autoencoders
Emanuele Palumbo · Laura Manduchi · Sonia Laguna · Daphné Chopard · Julia E Vogt