Skip to yearly menu bar Skip to main content


Poster Thu, Apr 23, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 4 P4-#5112

FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

Ziyang Fan ⋅ Keyu Chen ⋅ Ruilong Xing ⋅ Yulin Li ⋅ Li Jiang ⋅ Zhuotao Tian

Abstract

Log in and register to view live content