Skip to yearly menu bar Skip to main content


Poster

Squeeze the Soaked Sponge: Efficient Off-policy RFT for Large Language Model

Jing Liang · Hongyao Tang · Yi Ma · Jinyi Liu · YAN ZHENG · Shuyue Hu · LEI BAI · Jianye HAO

Abstract

Log in and register to view live content