Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

17 Results

<<   <   Page 1 of 2   >   >>
Poster
Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval
Zhenghao Liu · Chenyan Xiong · Yuanhuiyi Lv · Zhiyuan Liu · Ge Yu
Poster
Wed 2:30 An Extensible Multi-modal Multi-task Object Dataset with Materials
Trevor Standley · Ruohan Gao · Dawn Chen · Jiajun Wu · Silvio Savarese
Poster
Contrastive Audio-Visual Masked Autoencoder
Yuan Gong · Andrew Rouditchenko · Alexander Liu · David Harwath · Leonid Karlinsky · Hilde Kuehne · James R Glass
Poster
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
wei li · Linchao Zhu · Longyin Wen · Yi Yang
Poster
Is a Caption Worth a Thousand Images? A Study on Representation Learning
Shibani Santurkar · Yann Dubois · Rohan Taori · Percy Liang · Tatsunori Hashimoto
Workshop
Fri 5:25 Impossibility of Collective Intelligence
Krikamol Muandet
Poster
Wed 7:30 CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Hao-Wen Dong · Naoya Takahashi · Yuki Mitsufuji · Julian McAuley · Taylor Berg-Kirkpatrick
Workshop
Predicting Density of States via Multi-modal Transformer
Namkyeong Lee · Heewoong Noh · Sungwon Kim · Dongmin Hyun · Gyoung S. Na · Chanyoung Park
Poster
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Minghui HU · Chuanxia Zheng · Zuopeng Yang · Tat-Jen Cham · Heliang Zheng · Chaoyue Wang · Dacheng Tao · Ponnuthurai Suganthan
Poster
Wed 7:30 Diagnosing and Rectifying Vision Models using Language
Yuhui Zhang · Jeff Z. HaoChen · Shih-Cheng Huang · Kuan-Chieh Wang · James Y Zou · Serena Yeung
Poster
Mon 2:30 Masked Vision and Language Modeling for Multi-modal Representation Learning
Gukyeong Kwon · Zhaowei Cai · Avinash Ravichandran · Erhan Bas · Rahul Bhotika · Stefano Soatto