Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

220 Results

<<   <   Page 1 of 19   >   >>
Poster
LMSeg: Language-guided Multi-dataset Segmentation
Qiang Zhou · Yuang Liu · Chaohui Yu · Jingliang Li · Zhibin Wang · Fan Wang
Workshop
Thu 4:00 Modality-Aware Adaptation of Contrastive Language-Image Models
Alexander Long · Thalaiyasingam Ajanthan · Anton Hengel
Poster
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment
Hongwei Xue · Yuchong Sun · Bei Liu · Jianlong Fu · Ruihua Song · Houqiang Li · Jiebo Luo
Workshop
Thu 4:00 Coordinating Multiple Vision-Language Models for Visual Reasoning
Liangyu Chen · Bo Li · Sheng Shen · Jingkang Yang · Chunyuan Li · Kurt Keutzer · trevor darrell · Ziwei Liu
Workshop
Thu 4:00 Variational prompt tuning improves generalization of vision-language foundation models
Mohammad Mahdi Derakhshani · Enrique Sanchez · Adrian Bulat · Victor Guilherme Turrisi da Costa · Cees G Snoek · Georgios Tzimiropoulos · Brais Martinez
Poster
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency
Pengzhen Ren · Changlin Li · Hang Xu · Yi Zhu · Guangrun Wang · Jianzhuang Liu · Xiaojun Chang · Xiaodan Liang
Poster
PaLI: A Jointly-Scaled Multilingual Language-Image Model
Xi Chen · Xiao Wang · Soravit Changpinyo · AJ Piergiovanni · Piotr Padlewski · Daniel Salz · Sebastian Goodman · Adam Grycner · Basil Mustafa · Lucas Beyer · Alexander Kolesnikov · Joan Puigcerver · Nan Ding · Keran Rong · Hassan Akbari · Gaurav Mishra · Linting Xue · Ashish V. Thapliyal · James Bradbury · Weicheng Kuo · Mojtaba Seyedhosseini · Chao Jia · Burcu Karagol Ayan · Carlos Riquelme · Andreas Steiner · Anelia Angelova · Xiaohua Zhai · Neil Houlsby · Radu Soricut
Poster
HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention
Shijie Geng · Jianbo Yuan · Yu Tian · Yuxiao Chen · Yongfeng Zhang
Poster
Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus
Gang Li · Yang Li
Poster
Wed 7:30 Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao · Wangchunshu Zhou · Xinsong Zhang · Jiawei Wang
Oral
Wed 1:50 Visual Classification via Description from Large Language Models
Sachit Menon · Carl Vondrick
Poster
Wed 7:30 CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Hao-Wen Dong · Naoya Takahashi · Yuki Mitsufuji · Julian McAuley · Taylor Berg-Kirkpatrick