Skip to yearly menu bar Skip to main content


Virtual presentation / poster accept

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment

Hongwei Xue ⋅ Yuchong Sun ⋅ Bei Liu ⋅ Jianlong Fu ⋅ Ruihua Song ⋅ Houqiang Li ⋅ Jiebo Luo

Abstract

Video

Chat is not available.