Skip to yearly menu bar Skip to main content


Poster

PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback

Souradip Chakraborty · Amrit Bedi · Alec Koppel · Huazheng Wang · Dinesh Manocha · Mengdi Wang · Furong Huang
2024 Poster

Abstract

Video

Chat is not available.