Skip to yearly menu bar Skip to main content


Poster Thu, Apr 23, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 3 P3-#1412

Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling

Jiachun Li ⋅ Pengfei Cao ⋅ Zhuoran Jin ⋅ Yubo Chen ⋅ Jiexin Xu ⋅ Huaijun Li ⋅ Xiaojian Jiang ⋅ Kang Liu ⋅ Jun Zhao

Abstract

Log in and register to view live content