Skip to yearly menu bar Skip to main content


Poster Thu, Apr 23, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 4 P4-#3415

GranViT: A Fine-Grained Vision Model For Autoregressive Multimodal Large Language Models

Guanghao Zheng ⋅ Bowen Shi ⋅ Mingxing Xu ⋅ Ruoyu Sun ⋅ Peisen Zhao ⋅ Zhibo Zhang ⋅ Wenrui Dai ⋅ Junni Zou ⋅ Hongkai Xiong ⋅ XIAOPENG ZHANG ⋅ Qi Tian

Abstract

Log in and register to view live content