firstbacksecondback
16 Results
Poster
|
On the Convergence of AdaGrad(Norm) on RdRd: Beyond Convexity, Non-Asymptotic Rate and Acceleration Zijian Liu · Ta Duy Nguyen · Alina Ene · Huy Nguyen |
||
Poster
|
Tue 2:30 |
Noise Is Not the Main Factor Behind the Gap Between Sgd and Adam on Transformers, But Sign Descent Might Be Frederik Kunstner · Jacques Chen · Jonathan Lavington · Mark Schmidt |
|
Poster
|
Meta-learning Adaptive Deep Kernel Gaussian Processes for Molecular Property Prediction Wenlin Chen · Austin Tripp · José Miguel Hernández Lobato |
||
Poster
|
Mon 2:30 |
Distributed Extra-gradient with Optimal Complexity and Communication Guarantees Ali Ramezani-Kebrya · Kimon Antonakopoulos · Igor Krawczuk · Justin Deschenaux · Volkan Cevher |