Poster
|
Tue 17:00
|
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Zhang · Thomas Paine · Ofir Nachum · Cosmin Paduraru · George Tucker · ziyu wang · Mohammad Norouzi
|
|
Workshop
|
Fri 14:10
|
Improving Exploration in Policy Gradient Search: Application to Symbolic Optimization
|
|
Poster
|
Tue 9:00
|
Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime
Andrea Agazzi · Jianfeng Lu
|
|
Workshop
|
|
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu · Aviral Kumar · Aravind Rajeswaran · Rafael Rafailov · Sergey Levine · Chelsea Finn
|
|
Poster
|
Thu 9:00
|
Enforcing robust control guarantees within neural network policies
Priya Donti · Melrose Roderick · Mahyar Fazlyab · Zico Kolter
|
|
Poster
|
Tue 9:00
|
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu · Zhuoran Yang · Zhaoran Wang
|
|
Spotlight
|
Wed 21:25
|
Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control
Zhuang Liu · Xuanlin Li · Bingyi Kang · trevor darrell
|
|
Poster
|
Mon 17:00
|
Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control
Zhuang Liu · Xuanlin Li · Bingyi Kang · trevor darrell
|
|
Poster
|
Tue 17:00
|
The Importance of Pessimism in Fixed-Dataset Policy Optimization
Jacob Buckman · Carles Gelada · Marc G Bellemare
|
|
Workshop
|
|
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Jongmin Lee · Wonseok Jeon · Byung-Jun Lee · Joelle Pineau · Kee-Eung Kim
|
|
Poster
|
Wed 17:00
|
Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System
Jianhong Wang · Yuan Zhang · Tae-Kyun Kim · Yunjie Gu
|
|
Poster
|
Mon 9:00
|
Extracting Strong Policies for Robotics Tasks from Zero-Order Trajectory Optimizers
Cristina Pinneri · Shambhuraj Sawant · Sebastian Blaes · Georg Martius
|
|