Skip to yearly menu bar Skip to main content


Poster

Self-Aligned Reward: Towards Effective and Efficient Reasoners

Peixuan Han · ADIT KRISHNAN · Gerald Friedland · Jiaxuan You · Luyang Kong

Abstract

Log in and register to view live content