Skip to yearly menu bar Skip to main content


Poster

GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning

Xiangxiang Chu · Hailang Huang · Xiao Zhang · Fei Wei · Yong Wang

Abstract

Log in and register to view live content