Skip to yearly menu bar Skip to main content


Virtual presentation / top 25% paper

Learning to Grow Pretrained Models for Efficient Transformer Training

Peihao Wang ⋅ Rameswar Panda ⋅ Lucas Torroba Hennigen ⋅ Philip Greengard ⋅ Leonid Karlinsky ⋅ Rogerio Feris ⋅ David Cox ⋅ Zhangyang Wang ⋅ Yoon Kim

Abstract

Video

Chat is not available.