Skip to yearly menu bar Skip to main content


Blog Post Poster

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Bingxiang He · Zekai Qu · Zeyuan Liu · Yinghao Chen · Yuxin Zuo · Cheng Qian · Kaiyan Zhang · Weize Chen · Chaojun Xiao · Ganqu Cui · Ning Ding · Zhiyuan Liu

Abstract

Log in and register to view live content