Skip to yearly menu bar Skip to main content


Poster

PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback

Souradip Chakraborty ⋅ Amrit Bedi ⋅ Alec Koppel ⋅ Huazheng Wang ⋅ Dinesh Manocha ⋅ Mengdi Wang ⋅ Furong Huang
2024 Poster

Abstract

Video

Chat is not available.