Skip to yearly menu bar Skip to main content


Poster

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Javier Rando ⋅ Tony Wang ⋅ Stewart Slocum ⋅ Dmitrii Krasheninnikov ⋅ Usman Anwar ⋅ Micah Carroll ⋅ Xander Davies ⋅ Claudia Shi ⋅ Thomas Gilbert ⋅ Rachel Freedman ⋅ Charbel-Raphael Segerie ⋅ Phillip Christoffersen ⋅ Jacob Pfau ⋅ Tomek Korbak ⋅ Xin Chen ⋅ Lauro Langosco ⋅ Samuel Marks ⋅ Erdem Bıyık ⋅ Dorsa Sadigh ⋅ David Krueger ⋅ Pedro Freire ⋅ Mehul Damani ⋅ Jérémy Scheurer ⋅ David Lindner ⋅ Anca Dragan ⋅ Anand Siththaranjan ⋅ Dylan Hadfield-Menell ⋅ Max Nadeau ⋅ Stephen Casper ⋅ Peter Hase ⋅ Andi Peng ⋅ Eric Michaud
2025 Poster

Abstract

Video

Chat is not available.