Skip to yearly menu bar Skip to main content


I Can't Believe It Can't Count: Vision-Language Models Fail at Basic Enumeration Beyond the Subitizing Range

Amirhossein Afsharrad ⋅ Seyed Mousavi ⋅ Sanjay Lall

Abstract

Chat is not available.