Skip to yearly menu bar Skip to main content


Poster

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

Xiaodan Song ⋅ Jing Li ⋅ Cho-Jui Hsieh ⋅ Yang You ⋅ Srinadh Bhojanapalli ⋅ Jonathan Hseu ⋅ Sashank Reddi ⋅ Kurt Keutzer ⋅ Jim Demmel ⋅ Sanjiv Kumar

Abstract

Chat is not available.