Skip to yearly menu bar Skip to main content


Poster

Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling

Jiachun Li · Pengfei Cao · Zhuoran Jin · Yubo Chen · Jiexin Xu · Huaijun Li · Xiaojian Jiang · Kang Liu · Jun Zhao

Abstract

Log in and register to view live content