Skip to yearly menu bar Skip to main content


Contributed Talk 2: Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Libin Zhu ⋅ Chaoyue Liu ⋅ Adityanarayanan Radhakrishnan ⋅ Misha Belkin

Abstract

Video

Chat is not available.