Skip to yearly menu bar Skip to main content


Memorization Dynamics in Knowledge Distillation for Language Models

Jaydeep Borkar ⋅ Karan Chadha ⋅ Niloofar Mireshghallah ⋅ Yuchen Zhang ⋅ Irina-Elena Veliche ⋅ David Smith ⋅ Zheng Xu ⋅ Diego Garcia-Olano

Abstract

Log in and register to view live content