Skip to yearly menu bar Skip to main content


ONE MODEL TO TRAIN THEM ALL: HIERARCHICAL SELF-DISTILLATION FOR ENHANCED EARLY LAYER EMBEDDINGS

Andrea Gurioli ⋅ Federico Pennino ⋅ Joao Monteiro ⋅ Maurizio Gabbrielli

Abstract

Video

Chat is not available.