ICLR 2024

Poster

Tue 1:45

Multimodal Patient Representation Learning with Missing Modalities and Labels
Zhenbang Wu · Anant Dadu · Nicholas Tustison · Brian Avants · Michael Nalls · Jimeng Sun · Faraz Faghri

Poster

Thu 7:30

On the generalization capacity of neural networks during generic multimodal reasoning
Takuya Ito · Soham Dan · Mattia Rigotti · James Kozloski · Murray Campbell

Poster

Tue 1:45

Emu: Generative Pretraining in Multimodality
Quan Sun · Qiying Yu · Yufeng Cui · Fan Zhang · Xiaosong Zhang · Yueze Wang · Hongcheng Gao · Jingjing Liu · Tiejun Huang · Xinlong Wang

Poster

Wed 1:45

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
Juncheng Li · Kaihang Pan · Zhiqi Ge · Minghe Gao · Wei Ji · Wenqiao Zhang · Tat-Seng Chua · Siliang Tang · Hanwang Zhang · Yueting Zhuang

Affinity Workshop

Thu 7:30

What Does a Visual Formal Analysis of the World's 500 Most Famous Paintings Tell Us About Multimodal LLMs?
Muzi Tao · Saining Xie

Poster

Thu 7:30

Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization
Frederic Koehler · Thuy-Duong Vuong

Poster

Tue 7:30

Grounding Multimodal Large Language Models to the World
Zhiliang Peng · Wenhui Wang · Li Dong · Yaru Hao · Shaohan Huang · Shuming Ma · Qixiang Ye · Furu Wei

Poster

Thu 7:30

Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning
Mustafa Shukor · Alexandre Rame · Corentin Dancette · MATTHIEU CORD

Poster

Tue 1:45

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications
Paul Liang · Chun Kai Ling · Yun Cheng · Alexander Obolenskiy · Yudong Liu · Rohan Pandey · Alex Wilf · Louis-Philippe Morency · Russ Salakhutdinov

Affinity Workshop

What Does a Visual Formal Analysis of the World's 500 Most Famous Paintings Tell Us About Multimodal LLMs?
Muzi Tao · Saining Xie

Poster

Fri 1:45

Neuro-Inspired Information-Theoretic Hierarchical Perception for Multimodal Learning
Xiongye Xiao · Gengshuo Liu · Gaurav Gupta · Defu Cao · Shixuan Li · Yaxing Li · Tianqing Fang · Mingxi Cheng · Paul Bogdan

Poster

Tue 7:30

VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models
Zihao Zhu · Mingda Zhang · Shaokui Wei · Bingzhe Wu · Baoyuan Wu

Main Navigation

45 Results