firstbacksecondback
237 Results
Poster
|
Near-Optimal Adversarial Reinforcement Learning with Switching Costs Ming Shi · Yingbin Liang · Ness Shroff |
||
Poster
|
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning Zixiang Chen · Chris Junchi Li · Angela Yuan · Quanquan Gu · Michael Jordan |
||
Poster
|
Wed 7:30 |
Towards convergence to Nash equilibria in two-team zero-sum games Fivos Kalogiannis · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis |
|
Poster
|
Exponential Generalization Bounds with Near-Optimal Rates for Lq-Stable Algorithms Xiaotong Yuan · Ping Li |
||
Poster
|
Wed 2:30 |
On The Inadequacy of Optimizing Alignment and Uniformity in Contrastive Learning of Sentence Representations Zhijie Nie · Richong Zhang · Yongyi Mao |
|
Poster
|
Differentially Private Adaptive Optimization with Delayed Preconditioners Tian Li · Manzil Zaheer · Ken Liu · Sashank Reddi · H. Brendan McMahan · Virginia Smith |
||
Poster
|
Tue 7:30 |
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement Samuel Neumann · Sungsu Lim · Ajin Joseph · Yangchen Pan · Adam White · Martha White |
|
Oral
|
Wed 1:50 |
Multi-Objective Online Learning Jiyan Jiang · Wenpeng Zhang · Shiji Zhou · Lihong Gu · Xiaodong Zeng · Wenwu Zhu |
|
Poster
|
Tue 2:30 |
Adaptive Optimization in the ∞-Width Limit Etai Littwin · Greg Yang |
|
Poster
|
Wed 2:30 |
Multi-Objective Online Learning Jiyan Jiang · Wenpeng Zhang · Shiji Zhou · Lihong Gu · Xiaodong Zeng · Wenwu Zhu |
|
Poster
|
Wed 7:30 |
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences Alan Chan · Hugo Silva · Sungsu Lim · Tadashi Kozuno · A. Rupam Mahmood · Martha White |
|
Poster
|
Efficiently Controlling Multiple Risks with Pareto Testing Bracha Laufer-Goldshtein · Adam Fisch · Regina Barzilay · Tommi Jaakkola |