firstbacksecondback
3 Results
Poster
|
Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games Yuepeng Yang · Cong Ma |
||
Poster
|
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games Shicong Cen · Yuejie Chi · Simon Du · Lin Xiao |
||
Poster
|
Wed 7:30 |
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning Zhendong Wang · Jonathan J Hunt · Mingyuan Zhou |