Skip to yearly menu bar Skip to main content


Poster

Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Zihe Liu · Jiashun Liu · Yancheng He · Weixun Wang · JIAHENG LIU · Ling Pan · Xinyu Hu · Shaopan Xiong · Ju Huang · Jian Hu · Shengyi Huang · Siran Yang · Jiamang Wang · wenbo su · Bo Zheng

Abstract

Log in and register to view live content