firstbacksecondback
43 Results
Poster
|
Wed 17:00 |
Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System Jianhong Wang · Yuan Zhang · Tae-Kyun Kim · Yunjie Gu |
|
Poster
|
Thu 1:00 |
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Marcin Andrychowicz · Anton Raichuk · Piotr Stanczyk · Manu Orsini · Sertan Girgin · Raphaël Marinier · Léonard Hussenot-Desenonges · Matthieu Geist · Olivier Pietquin · Marcin Michalski · Sylvain Gelly · Olivier Bachem |
|
Oral
|
Thu 3:00 |
What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study Marcin Andrychowicz · Anton Raichuk · Piotr Stanczyk · Manu Orsini · Sertan Girgin · Raphaël Marinier · Léonard Hussenot-Desenonges · Matthieu Geist · Olivier Pietquin · Marcin Michalski · Sylvain Gelly · Olivier Bachem |
|
Poster
|
Wed 9:00 |
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients Brenden Petersen · Mikel Landajuela Larma · Terrell N Mundhenk · Claudio Santiago · Soo Kim · Joanne Kim |
|
Poster
|
Wed 1:00 |
Learning Task Decomposition with Ordered Memory Policy Network Yuchen Lu · Yikang Shen · Siyuan Zhou · Aaron Courville · Joshua B Tenenbaum · Chuang Gan |
|
Oral
|
Tue 19:00 |
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients Brenden Petersen · Mikel Landajuela Larma · Terrell N Mundhenk · Claudio Santiago · Soo Kim · Joanne Kim |
|
Poster
|
Mon 9:00 |
Extracting Strong Policies for Robotics Tasks from Zero-Order Trajectory Optimizers Cristina Pinneri · Shambhuraj Sawant · Sebastian Blaes · Georg Martius |