Skip to yearly menu bar Skip to main content


Poster

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Jixuan Leng · Chengsong Huang · Banghua Zhu · Jiaxin Huang
2025 Poster

Abstract

Video

Chat is not available.