Skip to yearly menu bar Skip to main content


Active Human Feedback Collection via Neural Contextual Dueling Bandits

Arun Verma ⋅ Xiaoqiang Lin ⋅ Zhongxiang Dai ⋅ Daniela Rus ⋅ Bryan Kian Hsiang Low

Abstract

Chat is not available.