Skip to yearly menu bar Skip to main content


Poster

Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models

Yongding Tao · Tian Wang · Yihong Dong · Huanyu Liu · Kechi Zhang · Hu XiaoLong · Ge Li

Abstract

Log in and register to view live content