firstbacksecondback
37 Results
Poster
|
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency Pengzhen Ren · Changlin Li · Hang Xu · Yi Zhu · Guangrun Wang · Jianzhuang Liu · Xiaojun Chang · Xiaodan Liang |
||
Workshop
|
Thu 4:00 |
Coordinating Multiple Vision-Language Models for Visual Reasoning Liangyu Chen · Bo Li · Sheng Shen · Jingkang Yang · Chunyuan Li · Kurt Keutzer · trevor darrell · Ziwei Liu |
|
Poster
|
Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval Zhenghao Liu · Chenyan Xiong · Yuanhuiyi Lv · Zhiyuan Liu · Ge Yu |
||
Poster
|
Unified Discrete Diffusion for Simultaneous Vision-Language Generation Minghui HU · Chuanxia Zheng · Zuopeng Yang · Tat-Jen Cham · Heliang Zheng · Chaoyue Wang · Dacheng Tao · Ponnuthurai Suganthan |
||
Workshop
|
Fri 2:06 |
Dynamic Pretraining of Vision-Language Models AJ Piergiovanni · Weicheng Kuo · Wei Li · Anelia Angelova |
|
Poster
|
Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning Zaid Khan · Yun Fu |
||
Poster
|
Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus Gang Li · Yang Li |
||
Workshop
|
Fri 5:19 |
Exploiting Category Names for Few-Shot Classification with Vision-Language Models Taihong Xiao · Zirui Wang · Liangliang Cao · Jiahui Yu · Shengyang Dai · Ming-Hsuan Yang |
|
Poster
|
Wed 7:30 |
Write and Paint: Generative Vision-Language Models are Unified Modal Learners Shizhe Diao · Wangchunshu Zhou · Xinsong Zhang · Jiawei Wang |
|
Workshop
|
Fri 5:10 |
Using Multimodal DNNs to Localize Vision-Language Integration in the Brain Vighnesh Subramaniam · Colin Conwell · Christopher Wang · Gabriel Kreiman · Boris Katz · Ignacio Cases · Andrei Barbu |
|
Workshop
|
Thu 4:00 |
Variational prompt tuning improves generalization of vision-language foundation models Mohammad Mahdi Derakhshani · Enrique Sanchez · Adrian Bulat · Victor Guilherme Turrisi da Costa · Cees G Snoek · Georgios Tzimiropoulos · Brais Martinez |
|
Poster
|
When and Why Vision-Language Models Behave like Bags-Of-Words, and What to Do About It? Mert Yuksekgonul · Federico Bianchi · Ria Kalluri · Dan Jurafsky · James Y Zou |