firstbacksecondback
155 Results
Poster
|
Mon 17:00 |
Regularized Inverse Reinforcement Learning Wonseok Jeon · Chen-Yang Su · Paul Barde · Thang Doan · Derek Nowrouzezahrai · Joelle Pineau |
|
Poster
|
Wed 9:00 |
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients Brenden Petersen · Mikel Landajuela Larma · Terrell N Mundhenk · Claudio Santiago · Soo Kim · Joanne Kim |
|
Poster
|
Wed 9:00 |
Human-Level Performance in No-Press Diplomacy via Equilibrium Search Jonathan Gray · Adam Lerer · Anton Bakhtin · Noam Brown |
|
Oral
|
Tue 19:00 |
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients Brenden Petersen · Mikel Landajuela Larma · Terrell N Mundhenk · Claudio Santiago · Soo Kim · Joanne Kim |
|
Oral
|
Wed 11:00 |
Human-Level Performance in No-Press Diplomacy via Equilibrium Search Jonathan Gray · Adam Lerer · Anton Bakhtin · Noam Brown |
|
Poster
|
Tue 17:00 |
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization Michael Zhang · Thomas Paine · Ofir Nachum · Cosmin Paduraru · George Tucker · ziyu wang · Mohammad Norouzi |
|
Spotlight
|
Thu 4:45 |
Self-supervised Visual Reinforcement Learning with Object-centric Representations Andrii Zadaianchuk · Maximilian Seitzer · Georg Martius |
|
Poster
|
Thu 1:00 |
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Marcin Andrychowicz · Anton Raichuk · Piotr Stanczyk · Manu Orsini · Sertan Girgin · Raphaël Marinier · Léonard Hussenot-Desenonges · Matthieu Geist · Olivier Pietquin · Marcin Michalski · Sylvain Gelly · Olivier Bachem |
|
Oral
|
Thu 3:00 |
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Marcin Andrychowicz · Anton Raichuk · Piotr Stanczyk · Manu Orsini · Sertan Girgin · Raphaël Marinier · Léonard Hussenot-Desenonges · Matthieu Geist · Olivier Pietquin · Marcin Michalski · Sylvain Gelly · Olivier Bachem |
|
Poster
|
Wed 9:00 |
Self-supervised Visual Reinforcement Learning with Object-centric Representations Andrii Zadaianchuk · Maximilian Seitzer · Georg Martius |
|
Poster
|
Mon 9:00 |
Extracting Strong Policies for Robotics Tasks from Zero-Order Trajectory Optimizers Cristina Pinneri · Shambhuraj Sawant · Sebastian Blaes · Georg Martius |