Skip to yearly menu bar Skip to main content


ONE MODEL TO TRAIN THEM ALL: HIERARCHICAL SELF-DISTILLATION FOR ENHANCED EARLY LAYER EMBEDDINGS

Andrea Gurioli · Federico Pennino · Joao Monteiro · Maurizio Gabbrielli

Abstract

Video

Chat is not available.