Skip to yearly menu bar Skip to main content


VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making

Jake Grigsby · Yuke Zhu · Michael Ryoo · Juan Carlos Niebles

Abstract

Chat is not available.