firstbacksecondback
4 Results
Poster
|
Wed 14:30 |
Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy Yuan Xie · Boyi Liu · Qiang Liu · Zhaoran Wang · Yuan Zhou · Jian Peng |
|
Poster
|
Wed 9:00 |
Reward Constrained Policy Optimization Chen Tessler · Daniel J Mankowitz · Shie Mannor |
|
Poster
|
Wed 9:00 |
Policy Transfer with Strategy Optimization Wenhao Yu · C. Liu · Greg Turk |
|
Poster
|
Thu 14:30 |
Bayesian Policy Optimization for Model Uncertainty Gilwoo Lee · Brian Hou · Aditya Mandalika · Jeongseok Lee · Sanjiban Choudhury · Siddhartha Srinivasa |