Skip to yearly menu bar Skip to main content


Does Data Contamination Make a Difference? Insights from Intentionally Contaminating Pre-training Data For Language Models

Minhao Jiang · Ken Liu · Ming Zhong · Rylan Schaeffer · Siru Ouyang · Jiawei Han · Sanmi Koyejo

Abstract

Chat is not available.