Skip to yearly menu bar Skip to main content


Poster

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Kexin Huang · Haoming Meng · Junkang Wu · Jinda Lu · Chiyu Ma · Ziqian Chen · xue wang · Bolin Ding · Jiancan Wu · Xiang Wang · Xiangnan He · Guoyin Wang · Jingren Zhou

Abstract

Log in and register to view live content