firstbacksecondback
152 Results
Poster
|
Thu 9:00 |
Model-Based Offline Planning Arthur Argenson · Gabriel Dulac-Arnold |
|
Workshop
|
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse TD Learning Angelos Filos · Clare Lyle · Yarin Gal · Sergey Levine · Natasha Jaques · Gregory Farquhar |
||
Poster
|
Thu 1:00 |
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights Byeongho Heo · Sanghyuk Chun · Seong Joon Oh · Dongyoon Han · Sangdoo Yun · Gyuwan Kim · Youngjung Uh · Jung-Woo Ha |
|
Poster
|
Mon 1:00 |
Randomized Ensembled Double Q-Learning: Learning Fast Without a Model Xinyue Chen · Che Wang · Zijian Zhou · Keith Ross |
|
Poster
|
Mon 1:00 |
QPLEX: Duplex Dueling Multi-Agent Q-Learning Jianhao Wang · Zhizhou Ren · Terry Liu · Yang Yu · Chongjie Zhang |
|
Poster
|
Tue 9:00 |
C-Learning: Horizon-Aware Cumulative Accessibility Estimation Panteha Naderian · Gabriel Loaiza-Ganem · Harry Braviner · Anthony Caterini · Jesse C Cresswell · Tong Li · Animesh Garg |
|
Workshop
|
Towards Reinforcement Learning in the Continuing Setting Abhishek Naik · Zaheer Abbas · Adam White · Richard Sutton |
||
Poster
|
Thu 9:00 |
C-Learning: Learning to Achieve Goals via Recursive Classification Benjamin Eysenbach · Ruslan Salakhutdinov · Sergey Levine |
|
Poster
|
Mon 17:00 |
Variational Intrinsic Control Revisited Taehwan Kwon |
|
Poster
|
Mon 1:00 |
Temporally-Extended ε-Greedy Exploration Will Dabney · Georg Ostrovski · Andre Barreto |
|
Workshop
|
Poisoning Deep Reinforcement Learning Agents with In-Distribution Triggers Clayton C Ashcraft |
||
Poster
|
Tue 17:00 |
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning Aviral Kumar · Rishabh Agarwal · Dibya Ghosh · Sergey Levine |