Skip to yearly menu bar Skip to main content


Does Data Contamination Make a Difference? Insights from Intentionally Contamination Pre-training Data For Language Models

Minhao Jiang ⋅ Ken Liu ⋅ Ming Zhong ⋅ Rylan Schaeffer ⋅ Siru Ouyang ⋅ Jiawei Han ⋅ Sanmi Koyejo

Abstract

Chat is not available.