Skip to yearly menu bar Skip to main content


Poster

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

Junkai Zhang · Zihao Wang · Lin Gui · Swarnashree Mysore Sathyendra · Jaehwan Jeong · Victor Veitch · Wei Wang · Yunzhong He · Bing Liu · Lifeng Jin

Abstract

Log in and register to view live content