firstbacksecondback
151 Results
Oral
|
Wed 11:00 |
Human-Level Performance in No-Press Diplomacy via Equilibrium Search Jonathan Gray · Adam Lerer · Anton Bakhtin · Noam Brown |
|
Poster
|
Tue 17:00 |
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization Michael Zhang · Thomas Paine · Ofir Nachum · Cosmin Paduraru · George Tucker · ziyu wang · Mohammad Norouzi |
|
Spotlight
|
Thu 4:45 |
Self-supervised Visual Reinforcement Learning with Object-centric Representations Andrii Zadaianchuk · Maximilian Seitzer · Georg Martius |
|
Oral
|
Thu 3:00 |
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Marcin Andrychowicz · Anton Raichuk · Piotr Stanczyk · Manu Orsini · Sertan Girgin · Raphaël Marinier · Léonard Hussenot-Desenonges · Matthieu Geist · Olivier Pietquin · Marcin Michalski · Sylvain Gelly · Olivier Bachem |
|
Poster
|
Thu 1:00 |
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Marcin Andrychowicz · Anton Raichuk · Piotr Stanczyk · Manu Orsini · Sertan Girgin · Raphaël Marinier · Léonard Hussenot-Desenonges · Matthieu Geist · Olivier Pietquin · Marcin Michalski · Sylvain Gelly · Olivier Bachem |
|
Poster
|
Wed 9:00 |
Self-supervised Visual Reinforcement Learning with Object-centric Representations Andrii Zadaianchuk · Maximilian Seitzer · Georg Martius |
|
Poster
|
Mon 9:00 |
Extracting Strong Policies for Robotics Tasks from Zero-Order Trajectory Optimizers Cristina Pinneri · Shambhuraj Sawant · Sebastian Blaes · Georg Martius |