Skip to yearly menu bar Skip to main content


Oral

Rethinking Reward Modeling in Preference-based Large Language Model Alignment

Hao Sun · Yunyi Shen · Jean-Francois Ton
2025 Oral

Abstract

Video

Chat is not available.