Skip to yearly menu bar Skip to main content


Poster Thu, Apr 23, 2026 • 11:15 AM – 1:45 PM PDT

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Xumeng Wen · Zihan Liu · Shun Zheng · Shengyu Ye · Zhirong Wu · Yang Wang · Zhijian Xu · Xiao Liang · Junjie Li · Ziming Miao · Jiang Bian · Mao Yang

Abstract

Log in and register to view live content