Skip to yearly menu bar Skip to main content


Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning

Alexandru Meterez ⋅ Lorenzo Noci ⋅ Thomas Hofmann ⋅ Antonio Orvieto

Abstract

Chat is not available.