Skip to yearly menu bar Skip to main content


Poster

Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty

Mehul Damani · Isha Puri · Stewart Slocum · Idan Shenfeld · Leshem Choshen · Yoon Kim · Jacob Andreas

Abstract

Log in and register to view live content