firstbacksecondback
89 Results
Poster
|
Tue 9:00 |
Value Propagation Networks Nantas Nardelli · Gabriel Synnaeve · Zeming Lin · Pushmeet Kohli · Philip Torr · Nicolas Usunier |
|
Poster
|
Wed 9:00 |
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search Lars Buesing · Theophane Weber · Yori Zwols · Nicolas Heess · Sebastien Racaniere · Arthur Guez · Jean-Baptiste Lespiau |
|
Poster
|
Wed 9:00 |
Preferences Implicit in the State of the World Rohin Shah · Dmitrii Krasheninnikov · Jordan Alexander · Pieter Abbeel · Anca Dragan |
|
Poster
|
Tue 9:00 |
Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic Mikael Henaff · Alfredo Canziani · Yann LeCun |
|
Poster
|
Wed 9:00 |
Information asymmetry in KL-regularized RL Alexandre Galashov · Siddhant Jayakumar · Leonard Hasenclever · Dhruva Tirumala · Jonathan Schwarz · Guillaume Desjardins · Wojciech M Czarnecki · Yee Whye Teh · Razvan Pascanu · Nicolas Heess |
|
Poster
|
Thu 14:30 |
Attention, Learn to Solve Routing Problems! Wouter Kool · Herke van Hoof · Max Welling |
|
Poster
|
Thu 9:00 |
From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following Justin Fu · Anoop Korattikara Balan · Sergey Levine · Sergio Guadarrama |
|
Poster
|
Wed 9:00 |
M^3RL: Mind-aware Multi-agent Management Reinforcement Learning Tianmin Shu · Yuandong Tian |
|
Poster
|
Wed 9:00 |
A new dog learns old tricks: RL finds classic optimization algorithms Weiwei Kong · Christopher Liaw · Aranyak Mehta · D. Sivakumar |
|
Poster
|
Tue 9:00 |
Recall Traces: Backtracking Models for Efficient Reinforcement Learning Anirudh Goyal · Philemon Brakel · William Fedus · Soumye Singhal · Timothy Lillicrap · Sergey Levine · Hugo Larochelle · Yoshua Bengio |
|
Poster
|
Wed 9:00 |
CEM-RL: Combining evolutionary and gradient-based methods for policy search Aloïs Pourchot · Olivier Sigaud |
|
Poster
|
Wed 9:00 |
Exploration by random network distillation Yuri Burda · Harrison Edwards · Amos Storkey · Oleg Klimov |