Skip to yearly menu bar Skip to main content


Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

Qiyue Gao ⋅ Xinyu Pi ⋅ Kevin Liu ⋅ Junrong Chen ⋅ Ruolan Yang ⋅ Xinqi Huang ⋅ Xinyu Fang ⋅ Lu Sun ⋅ Gautham Kishore ⋅ Bo Ai ⋅ Stone Tao ⋅ Mengyang Liu ⋅ Jiaxi Yang ⋅ Chao-Jung Lai ⋅ Chuanyang Jin ⋅ Jiannan Xiang ⋅ Benhao Huang ⋅ David Danks ⋅ Hao Su ⋅ Tianmin Shu ⋅ Ziqiao Ma ⋅ Lianhui Qin ⋅ Zhiting Hu

Abstract

Video

Chat is not available.