Skip to yearly menu bar Skip to main content


Poster

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Yukang Chen ⋅ Fuzhao Xue ⋅ Dacheng Li ⋅ Qinghao Hu ⋅ Ligeng Zhu ⋅ Xiuyu Li ⋅ Yunhao Fang ⋅ Haotian Tang ⋅ Shang Yang ⋅ Zhijian Liu ⋅ Ethan He ⋅ Hongxu Yin ⋅ Pavlo Molchanov ⋅ Jan Kautz ⋅ Jim Fan ⋅ Yuke Zhu ⋅ Yao Lu ⋅ Song Han
2025 Poster

Abstract

Video

Chat is not available.