firstbacksecondback
28 Results
Poster
|
Behavior Proximal Policy Optimization Zifeng Zhuang · Kun LEI · Jinxin Liu · Donglin Wang · Yilang Guo |
||
Poster
|
Mon 7:30 |
Near-optimal Policy Identification in Active Reinforcement Learning Xiang Li · Viraj Mehta · Johannes Kirschner · Ian Char · Willie Neiswanger · Jeff Schneider · Andreas Krause · Ilija Bogunovic |
|
Poster
|
Wed 7:30 |
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games Fivos Kalogiannis · Ioannis Anagnostides · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Vaggos Chatziafratis · Stelios Stavroulakis |
|
Poster
|
Memory Gym: Partially Observable Challenges to Memory-Based Agents Marco Pleines · Matthias Pallasch · Frank Zimmer · Mike Preuss |
||
Oral
|
Mon 6:50 |
Near-optimal Policy Identification in Active Reinforcement Learning Xiang Li · Viraj Mehta · Johannes Kirschner · Ian Char · Willie Neiswanger · Jeff Schneider · Andreas Krause · Ilija Bogunovic |
|
Oral
|
Wed 6:10 |
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games Fivos Kalogiannis · Ioannis Anagnostides · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Vaggos Chatziafratis · Stelios Stavroulakis |
|
Poster
|
Wed 2:30 |
An Adaptive Policy to Employ Sharpness-Aware Minimization Weisen JIANG · Hansi Yang · Yu Zhang · James Kwok |
|
Poster
|
Efficient Offline Policy Optimization with a Learned Model Zichen Liu · Siyi Li · Wee Sun Lee · shuicheng YAN · Zhongwen Xu |
||
Poster
|
Mon 7:30 |
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization Rajkumar Ramamurthy · Prithviraj Ammanabrolu · Kianté Brantley · Jack Hessel · Rafet Sifa · Christian Bauckhage · Hannaneh Hajishirzi · Yejin Choi |
|
Poster
|
Graph-based Deterministic Policy Gradient for Repetitive Combinatorial Optimization Problems Zhongyuan Zhao · Ananthram Swami · Santiago Segarra |
||
Poster
|
Wed 7:30 |
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences Alan Chan · Hugo Silva · Sungsu Lim · Tadashi Kozuno · A. Rupam Mahmood · Martha White |
|
Poster
|
Tue 7:30 |
Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization Jihwan Jeong · Xiaoyu Wang · Michael Gimelfarb · Hyunwoo Kim · Baher Abdulhai · Scott Sanner |