Skip to yearly menu bar Skip to main content


Poster

Language Model Self-improvement by Reinforcement Learning Contemplation

Jing-Cheng Pang ⋅ Pengyuan Wang ⋅ Kaiyuan Li ⋅ XiongHui Chen ⋅ Jiacheng Xu ⋅ Zongzhang Zhang ⋅ Yang Yu
2024 Poster

Abstract

Video

Chat is not available.