Skip to yearly menu bar Skip to main content


Teaching Language Models to Critique via Reinforcement Learning

Zhihui Xie · Jie chen · Liyu Chen · Weichao Mao · Jingjing Xu · Lingpeng Kong

Abstract

Video

Chat is not available.