Poster
|
Wed 7:30
|
SGDA with shuffling: faster convergence for nonconvex-PŁ minimax optimization
Hanseul Cho · Chulhee Yun
|
|
Poster
|
|
Accelerated Single-Call Methods for Constrained Min-Max Optimization
Yang Cai · Weiqiang Zheng
|
|
Poster
|
Wed 7:30
|
Solving stochastic weak Minty variational inequalities without increasing batch size
Thomas Pethick · Olivier Fercoq · Puya Latafat · Panagiotis Patrinos · Volkan Cevher
|
|
Oral
|
Wed 6:50
|
Depth Separation with Multilayer Mean-Field Networks
Yunwei Ren · Mo Zhou · Rong Ge
|
|
Poster
|
Tue 7:30
|
TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization
Xiang Li · Junchi YANG · Niao He
|
|
Poster
|
Mon 2:30
|
Understanding Edge-of-Stability Training Dynamics with a Minimalist Example
Xingyu Zhu · Zixuan Wang · Xiang Wang · Mo Zhou · Rong Ge
|
|
Poster
|
Wed 7:30
|
Depth Separation with Multilayer Mean-Field Networks
Yunwei Ren · Mo Zhou · Rong Ge
|
|
Poster
|
|
Understanding the Generalization of Adam in Learning Neural Networks with Proper Regularization
Difan Zou · Yuan Cao · Yuanzhi Li · Quanquan Gu
|
|
Poster
|
|
Behind the Scenes of Gradient Descent: A Trajectory Analysis via Basis Function Decomposition
Jianhao Ma · Lingjun Guo · Salar Fattahi
|
|
Oral
|
Wed 1:00
|
DASHA: Distributed Nonconvex Optimization with Communication Compression and Optimal Oracle Complexity
Alexander Tyurin · Peter Richtarik
|
|
Poster
|
Wed 2:30
|
DASHA: Distributed Nonconvex Optimization with Communication Compression and Optimal Oracle Complexity
Alexander Tyurin · Peter Richtarik
|
|
Poster
|
|
Projective Proximal Gradient Descent for Nonconvex Nonsmooth Optimization: Fast Convergence Without Kurdyka-Lojasiewicz (KL) Property
Yingzhen Yang · Ping Li
|
|