firstbacksecondback
673 Results
Workshop
|
Few-Shot Adaptation of Vision-Language Foundation Models via Dual-Path Inference Ce Zhang · Simon Stepputtis · Katia Sycara · Yaqi Xie |
||
Poster
|
Tue 1:45 |
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models Archiki Prasad · Elias Stengel-Eskin · Mohit Bansal |
|
Poster
|
Fri 7:30 |
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning Juan Rocamonde · Victoriano Montesinos · Elvis Nava · Ethan Perez · David Lindner |
|
Workshop
|
How Far Are We from Intelligent Visual Deductive Reasoning? Yizhe Zhang · He Bai · Ruixiang Zhang · Jiatao Gu · Shuangfei Zhai · Joshua Susskind · Navdeep Jaitly |
||
Poster
|
Thu 7:30 |
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy Simon Ging · Maria A. Bravo · Thomas Brox |
|
Workshop
|
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models Yupan Huang · Zaiqiao Meng · Fangyu Liu · Yixuan Su · Nigel Collier · Yutong Lu |
||
Workshop
|
Pre-training Concept Frequency is predictive of CLIP Zero-shot Performance Vishaal Udandarao · Ameya Prabhu · Philip Torr · Adel Bibi · Samuel Albanie · Matthias Bethge |
||
Poster
|
Fri 7:30 |
Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment Utkarsh Kumar Mall · Cheng Perng Phoo · Meilin Liu · Carl Vondrick · Bharath Hariharan · Kavita Bala |
|
Poster
|
Wed 7:30 |
Tag2Text: Guiding Vision-Language Model via Image Tagging Xinyu Huang · Youcai Zhang · Jinyu Ma · Weiwei Tian · Rui Feng · Yuejie Zhang · Yaqian Li · Yandong Guo · Lei Zhang |
|
Poster
|
Tue 7:30 |
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model Sihan Chen · Xingjian He · Handong Li · Xiaojie Jin · Jiashi Feng · Jing Liu |
|
Poster
|
Thu 1:45 |
Negative Label Guided OOD Detection with Pretrained Vision-Language Models Xue JIANG · Feng Liu · Zhen Fang · Hong Chen · Tongliang Liu · Feng Zheng · Bo Han |
|
Poster
|
Thu 7:30 |
Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models Shuai Fu · Shuai Fu · Xiequn Wang · Qiushi Huang · Yu Zhang |