firstbacksecondback
8 Results
Poster
|
Wed 7:30 |
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games Fivos Kalogiannis · Ioannis Anagnostides · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Vaggos Chatziafratis · Stelios Stavroulakis |
|
Oral
|
Wed 6:10 |
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games Fivos Kalogiannis · Ioannis Anagnostides · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Vaggos Chatziafratis · Stelios Stavroulakis |
|
Workshop
|
Accelerating Policy Gradient by Estimating Value Function from Prior Computation in Deep Reinforcement Learning Md Masudur Rahman · Yexiang Xue |
||
Poster
|
Improving Deep Policy Gradients with Value Function Search Enrico Marchesini · Christopher Amato |
||
Poster
|
Graph-based Deterministic Policy Gradient for Repetitive Combinatorial Optimization Problems Zhongyuan Zhao · Ananthram Swami · Santiago Segarra |
||
Poster
|
Tue 7:30 |
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement Samuel Neumann · Sungsu Lim · Ajin Joseph · Yangchen Pan · Adam White · Martha White |
|
Poster
|
Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies Rui Yuan · Simon Du · Robert M. Gower · Alessandro Lazaric · Lin Xiao |
||
Poster
|
Mon 7:30 |
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning Pan Lu · Liang Qiu · Kai-Wei Chang · Yingnian Wu · Song-Chun Zhu · Tanmay Rajpurohit · Peter Clark · Ashwin Kalyan |