Skip to yearly menu bar Skip to main content


Are Easier or Harder Examples Better? Rethinking Data Selection for Reward Models and Preference Optimization

Kevin Christian Wibisono ⋅ Aya Ismail ⋅ Pedro O Pinheiro ⋅ Yixin Wang ⋅ Kyunghyun Cho ⋅ Natasa Tagasovska ⋅ Rajesh Ranganath

Abstract

Chat is not available.