Skip to yearly menu bar Skip to main content


VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making

Jake Grigsby ⋅ Yuke Zhu ⋅ Michael Ryoo ⋅ Juan Carlos Niebles

Abstract

Chat is not available.