Skip to yearly menu bar Skip to main content


Poster

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Chenxi Whitehouse · Tianlu Wang · Ping Yu · Xian Li · Jason E Weston · Ilia Kulikov · Swarnadeep Saha

Abstract

Log in and register to view live content