Skip to yearly menu bar Skip to main content


Poster

GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding

Dmitry Lepikhin ⋅ HyoukJoong Lee ⋅ Yuanzhong Xu ⋅ Dehao Chen ⋅ Orhan Firat ⋅ Yanping Huang ⋅ Maxim Krikun ⋅ Noam Shazeer ⋅ Zhifeng Chen
2021 Poster

Abstract

Video

Chat is not available.