Skip to yearly menu bar Skip to main content


Poster

MixKD: Towards Efficient Distillation of Large-scale Language Models

Kevin Liang ⋅ Weituo Hao ⋅ Dinghan Shen ⋅ Yufan Zhou ⋅ Weizhu Chen ⋅ Changyou Chen ⋅ Lawrence Carin
2021 Poster

Abstract

Video

Chat is not available.