Skip to yearly menu bar Skip to main content


West-of-N: Synthetic Preference Generation for Improved Reward Modeling

Alizée Pace ⋅ Jonathan Mallinson ⋅ Eric Malmi ⋅ Sebastian Krause ⋅ Aliaksei Severyn

Abstract

Chat is not available.