Skip to yearly menu bar Skip to main content


Oral Thu, Apr 23, 2026 • 12:03 PM – 12:13 PM PDT 202 A/B

FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

Ziyang Fan ⋅ Keyu Chen ⋅ Ruilong Xing ⋅ Yulin Li ⋅ Li Jiang ⋅ Zhuotao Tian

Abstract

Log in and register to view live content