Skip to yearly menu bar Skip to main content


Poster

WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM

Changli Tang · Qinfan Xiao · Ke Mei · Tianyi Wang · Fengyun Rao · Chao Zhang

Abstract

Log in and register to view live content