firstbacksecondback
317 Results
Poster
|
Mon 7:30 |
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning Haichao Zhang · Wei Xu · Haonan Yu |
|
Oral
|
Tue 7:10 |
Extreme Q-Learning: MaxEnt RL without Entropy Divyansh Garg · Joey Hejna · Matthieu Geist · Stefano Ermon |
|
Poster
|
Tue 7:30 |
Extreme Q-Learning: MaxEnt RL without Entropy Divyansh Garg · Joey Hejna · Matthieu Geist · Stefano Ermon |
|
Poster
|
In-sample Actor Critic for Offline Reinforcement Learning Hongchang Zhang · Yixiu Mao · Boyuan Wang · Shuncheng He · Yi Xu · Xiangyang Ji |
||
Workshop
|
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories Qinqing Zheng · Mikael Henaff · Brandon Amos · Aditya Grover |
||
Poster
|
Wed 2:30 |
Explaining RL Decisions with Trajectories Shripad Deshmukh · Arpan Dasgupta · Balaji Krishnamurthy · Nan Jiang · Chirag Agarwal · Georgios Theocharous · Jayakumar Subramanian |
|
Oral
|
Wed 6:10 |
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games Fivos Kalogiannis · Ioannis Anagnostides · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Vaggos Chatziafratis · Stelios Stavroulakis |
|
Poster
|
Wed 7:30 |
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games Fivos Kalogiannis · Ioannis Anagnostides · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Vaggos Chatziafratis · Stelios Stavroulakis |
|
Poster
|
Tue 7:30 |
Confidence-Conditioned Value Functions for Offline Reinforcement Learning Joey Hong · Aviral Kumar · Sergey Levine |
|
Oral
|
Tue 7:00 |
Confidence-Conditioned Value Functions for Offline Reinforcement Learning Joey Hong · Aviral Kumar · Sergey Levine |
|
Oral
|
Mon 1:50 |
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes Aviral Kumar · Rishabh Agarwal · Xinyang Geng · George Tucker · Sergey Levine |
|
Poster
|
Mon 2:30 |
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes Aviral Kumar · Rishabh Agarwal · Xinyang Geng · George Tucker · Sergey Levine |