firstbacksecondback
45 Results
Workshop
|
FacePhi: Lightweight Multimodal Large Language Model for Facial Landmark Emotion Recognition Hongjin Zhao · Zheyuan Liu · Yang Liu · Zhenyue Qin · Jiaxu Liu · Tom Gedeon |
||
Workshop
|
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models Yupan Huang · Zaiqiao Meng · Fangyu Liu · Yixuan Su · Nigel Collier · Yutong Lu |
||
Workshop
|
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Xiangming Gu · Xiaosen Zheng · Tianyu Pang · Chao Du · Qian Liu · Ye Wang · Jing Jiang · Min Lin |
||
Workshop
|
SELF-IMAGINE: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination Syeda Nahida Akter · Aman Madaan · Sangwu Lee · Yiming Yang · Eric Nyberg |
||
Workshop
|
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks Jing Yu Koh · Robert Lo · Lawrence Jang · Vikram Duvvur · Ming Lim · Po-Yu Huang · Graham Neubig · Shuyan Zhou · Ruslan Salakhutdinov · Daniel Fried |
||
Workshop
|
Sat 2:10 |
Invited Talk: Building a Multimodal Dataset for African languages Claytone Sikasote |
|
Workshop
|
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study Weihao Tan · Ziluo Ding · Wentao Zhang · Boyu Li · Bohan Zhou · Junpeng Yue · Haochong Xia · Jiechuan Jiang · Longtao Zheng · Xinrun Xu · Yifei Bi · Pengjie Gu · Xinrun Wang · Börje Karlsson · Bo An · Zongqing Lu |
||
Poster
|
Tue 1:45 |
DreamLLM: Synergistic Multimodal Comprehension and Creation Runpei Dong · chunrui han · Yuang Peng · Zekun Qi · Zheng Ge · Jinrong Yang · Liang Zhao · Jianjian Sun · Hongyu Zhou · Haoran Wei · Xiangwen Kong · Xiangyu Zhang · Kaisheng Ma · Li Yi |
|
Poster
|
Thu 7:30 |
Kosmos-G: Generating Images in Context with Multimodal Large Language Models Xichen Pan · Li Dong · Shaohan Huang · Zhiliang Peng · Wenhu Chen · Furu Wei |
|
Poster
|
Tue 7:30 |
EQA-MX: Embodied Question Answering using Multimodal Expression Md Mofijul Islam · Alexi Gladstone · Riashat Islam · Tariq Iqbal |
|
Poster
|
Fri 1:45 |
Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data Antonis Antoniades · Yiyi Yu · Joe Canzano · William Wang · Spencer Smith |
|
Poster
|
Tue 1:45 |
Deep Generative Clustering with Multimodal Diffusion Variational Autoencoders Emanuele Palumbo · Laura Manduchi · Sonia Laguna · Daphné Chopard · Julia E Vogt |