firstbacksecondback
9 Results
Poster
|
Wed 14:30 |
G-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space Qi Meng · Shuxin Zheng · Huishuai Zhang · Wei Chen · Qiwei Ye · Zhi-Ming Ma · Nenghai Yu · Tie-Yan Liu |
|
Poster
|
Wed 14:30 |
Adaptive Gradient Methods with Dynamic Bound of Learning Rate Liangchen Luo · Yuanhao Xiong · Yan Liu · Xu Sun |
|
Poster
|
Wed 14:30 |
Universal Stagewise Learning for Non-Convex Problems with Convergence on Averaged Solutions Zaiyi Chen · Zhuoning Yuan · Jinfeng Yi · Bowen Zhou · Enhong Chen · Tianbao Yang |
|
Poster
|
Wed 14:30 |
Deterministic PAC-Bayesian generalization bounds for deep networks via generalizing noise-resilience Vaishnavh Nagarajan · Zico Kolter |
|
Poster
|
Wed 14:30 |
SGD Converges to Global Minimum in Deep Learning via Star-convex Path Yi Zhou · Junjie Yang · Huishuai Zhang · Yingbin Liang · VAHID TAROKH |
|
Poster
|
Wed 14:30 |
On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length Stanislaw Jastrzebski · Zachary Kenton · Nicolas Ballas · Asja Fischer · Yoshua Bengio · Amos Storkey |
|
Poster
|
Wed 14:30 |
Preconditioner on Matrix Lie Group for SGD XI-LIN LI |
|
Poster
|
Wed 14:30 |
Local SGD Converges Fast and Communicates Little Sebastian Stich |
|
Poster
|
Wed 14:30 |
Quasi-hyperbolic momentum and Adam for deep learning Jerry Ma · Denis Yarats |