Skip to yearly menu bar Skip to main content


Poster

Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding

Yuanhao Xiong ⋅ Long Zhao ⋅ Boqing Gong ⋅ Ming-Hsuan Yang ⋅ Florian Schroff ⋅ Ting Liu ⋅ Cho-Jui Hsieh ⋅ Liangzhe Yuan
2024 Poster

Abstract

Video

Chat is not available.