Skip to yearly menu bar Skip to main content


Bayesian reward models for LLM alignment

Adam Yang · Maxime Robeyns · Thomas Coste · Jun Wang · Haitham Bou Ammar · Laurence Aitchison

Abstract

Chat is not available.