firstbacksecondback
6 Results
Poster
|
Wed 14:30 |
signSGD with Majority Vote is Communication Efficient and Fault Tolerant Jeremy Bernstein · Jiawei Zhao · Kamyar Azizzadenesheli · Anima Anandkumar |
|
Poster
|
Wed 14:30 |
AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods Zhiming Zhou · Qingru Zhang · Guansong Lu · Hongwei Wang · Weinan Zhang · Yong Yu |
|
Poster
|
Wed 14:30 |
Adaptivity of deep ReLU network for learning in Besov and mixed smooth Besov spaces: optimal rate and curse of dimensionality Taiji Suzuki |
|
Poster
|
Wed 14:30 |
Theoretical Analysis of Auto Rate-Tuning by Batch Normalization Sanjeev Arora · Zhiyuan Li · Kaifeng Lyu |
|
Poster
|
Wed 14:30 |
Adaptive Gradient Methods with Dynamic Bound of Learning Rate Liangchen Luo · Yuanhao Xiong · Yan Liu · Xu Sun |
|
Poster
|
Wed 14:30 |
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation Akhilesh Deepak Gotmare · Nitish Shirish Keskar · Caiming Xiong · richard socher |