Skip to yearly menu bar Skip to main content


Bayesian reward models for LLM alignment

Adam Yang ⋅ Maxime Robeyns ⋅ Thomas Coste ⋅ Jun Wang ⋅ Haitham Bou Ammar ⋅ Laurence Aitchison

Abstract

Chat is not available.