Skip to yearly menu bar Skip to main content


Poster

How reinforcement learning after next-token prediction facilitates learning

Nikolaos Tsilivis · Eran Malach · Karen Ullrich · Julia Kempe

Abstract

Log in and register to view live content