Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

45 Results

<<   <   Page 3 of 4   >   >>
Poster
Tue 1:45 Multimodal Patient Representation Learning with Missing Modalities and Labels
Zhenbang Wu · Anant Dadu · Nicholas Tustison · Brian Avants · Michael Nalls · Jimeng Sun · Faraz Faghri
Poster
Thu 7:30 On the generalization capacity of neural networks during generic multimodal reasoning
Takuya Ito · Soham Dan · Mattia Rigotti · James Kozloski · Murray Campbell
Poster
Tue 1:45 Emu: Generative Pretraining in Multimodality
Quan Sun · Qiying Yu · Yufeng Cui · Fan Zhang · Xiaosong Zhang · Yueze Wang · Hongcheng Gao · Jingjing Liu · Tiejun Huang · Xinlong Wang
Poster
Wed 1:45 Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
Juncheng Li · Kaihang Pan · Zhiqi Ge · Minghe Gao · Wei Ji · Wenqiao Zhang · Tat-Seng Chua · Siliang Tang · Hanwang Zhang · Yueting Zhuang
Affinity Workshop
Thu 7:30 What Does a Visual Formal Analysis of the World's 500 Most Famous Paintings Tell Us About Multimodal LLMs?
Muzi Tao · Saining Xie
Poster
Thu 7:30 Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization
Frederic Koehler · Thuy-Duong Vuong
Poster
Tue 7:30 Grounding Multimodal Large Language Models to the World
Zhiliang Peng · Wenhui Wang · Li Dong · Yaru Hao · Shaohan Huang · Shuming Ma · Qixiang Ye · Furu Wei
Poster
Thu 7:30 Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning
Mustafa Shukor · Alexandre Rame · Corentin Dancette · MATTHIEU CORD
Poster
Tue 1:45 Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications
Paul Liang · Chun Kai Ling · Yun Cheng · Alexander Obolenskiy · Yudong Liu · Rohan Pandey · Alex Wilf · Louis-Philippe Morency · Russ Salakhutdinov
Affinity Workshop
What Does a Visual Formal Analysis of the World's 500 Most Famous Paintings Tell Us About Multimodal LLMs?
Muzi Tao · Saining Xie
Poster
Fri 1:45 Neuro-Inspired Information-Theoretic Hierarchical Perception for Multimodal Learning
Xiongye Xiao · Gengshuo Liu · Gaurav Gupta · Defu Cao · Shixuan Li · Yaxing Li · Tianqing Fang · Mingxi Cheng · Paul Bogdan
Poster
Tue 7:30 VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models
Zihao Zhu · Mingda Zhang · Shaokui Wei · Bingzhe Wu · Baoyuan Wu