Skip to yearly menu bar Skip to main content


Teaching Language Models to Critique via Reinforcement Learning

Zhihui Xie ⋅ Jie chen ⋅ Liyu Chen ⋅ Weichao Mao ⋅ Jingjing Xu ⋅ Lingpeng Kong

Abstract

Video

Chat is not available.