firstbacksecondback
154 Results
Poster
|
Mon 9:00 |
Self-Supervised Policy Adaptation during Deployment Nicklas Hansen · Rishabh Jangir · Yu Sun · Guillem Alenyà · Pieter Abbeel · Alexei Efros · Lerrel Pinto · Xiaolong Wang |
|
Poster
|
Wed 1:00 |
Acting in Delayed Environments with Non-Stationary Markov Policies Esther Derman · Gal Dalal · Shie Mannor |
|
Poster
|
Tue 17:00 |
The Importance of Pessimism in Fixed-Dataset Policy Optimization Jacob Buckman · Carles Gelada · Marc G Bellemare |
|
Spotlight
|
Thu 3:25 |
UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers Siyi Hu · Fengda Zhu · Xiaojun Chang · Xiaodan Liang |
|
Spotlight
|
Wed 21:25 |
Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control Zhuang Liu · Xuanlin Li · Bingyi Kang · trevor darrell |
|
Poster
|
Mon 17:00 |
UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers Siyi Hu · Fengda Zhu · Xiaojun Chang · Xiaodan Liang |
|
Workshop
|
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse TD Learning Angelos Filos · Clare Lyle · Yarin Gal · Sergey Levine · Natasha Jaques · Gregory Farquhar |
||
Poster
|
Mon 1:00 |
Randomized Ensembled Double Q-Learning: Learning Fast Without a Model Xinyue Chen · Che Wang · Zijian Zhou · Keith Ross |
|
Poster
|
Mon 1:00 |
QPLEX: Duplex Dueling Multi-Agent Q-Learning Jianhao Wang · Zhizhou Ren · Terry Liu · Yang Yu · Chongjie Zhang |
|
Poster
|
Tue 9:00 |
C-Learning: Horizon-Aware Cumulative Accessibility Estimation Panteha Naderian · Gabriel Loaiza-Ganem · Harry Braviner · Anthony Caterini · Jesse C Cresswell · Tong Li · Animesh Garg |
|
Workshop
|
Towards Reinforcement Learning in the Continuing Setting Abhishek Naik · Zaheer Abbas · Adam White · Richard Sutton |
||
Poster
|
Thu 9:00 |
Enforcing robust control guarantees within neural network policies Priya Donti · Melrose Roderick · Mahyar Fazlyab · Zico Kolter |