Skip to yearly menu bar Skip to main content


Process-then-Retrieve: A Mechanistic Study of Cross-Modal Alignment in Vision-Language Models

Arpita Shanbhag ⋅ Julia Tran ⋅ Dhruv Mandala ⋅ Ayda Sultan

Abstract

Chat is not available.