Skip to yearly menu bar Skip to main content


The Delta Learning Hypothesis: Preference Tuning on Weak Data Can Yield Strong Gains

Scott Geng ⋅ Hamish Ivison ⋅ Chun-Liang Li ⋅ Maarten Sap ⋅ Jerry Li ⋅ Ranjay Krishna ⋅ Pang Wei Koh

Abstract

Video

Chat is not available.