Skip to yearly menu bar Skip to main content


Blog Track Poster Sat, Apr 25, 2026 • 11:15 AM – 1:45 PM PDT Pavilion 4 P4-#4712

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Bingxiang He ⋅ Zekai Qu ⋅ Zeyuan Liu ⋅ Yinghao Chen ⋅ Yuxin Zuo ⋅ Cheng Qian ⋅ Kaiyan Zhang ⋅ Weize Chen ⋅ Chaojun Xiao ⋅ Ganqu Cui ⋅ Ning Ding ⋅ Zhiyuan Liu

Abstract

Log in and register to view live content