firstbacksecondback
10 Results
Poster
|
Thu 17:00 |
Learning to Sample with Local and Global Contexts in Experience Replay Buffer Youngmin Oh · Kimin Lee · Jinwoo Shin · Eunho Yang · Sung Ju Hwang |
|
Poster
|
Thu 17:00 |
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds Yihao Feng · Ziyang Tang · Na Zhang · Qiang Liu |
|
Poster
|
Tue 17:00 |
DOP: Off-Policy Multi-Agent Decomposed Policy Gradients Yihan Wang · Beining Han · Tonghan Wang · Heng Dong · Chongjie Zhang |
|
Poster
|
Thu 1:00 |
Representation Balancing Offline Model-based Reinforcement Learning Byung-Jun Lee · Jongmin Lee · Kee-Eung Kim |
|
Poster
|
Mon 1:00 |
Parameter-Based Value Functions Francesco Faccio · Louis Kirsch · Jürgen Schmidhuber |
|
Poster
|
Wed 9:00 |
Benchmarks for Deep Off-Policy Evaluation Justin Fu · Mohammad Norouzi · Ofir Nachum · George Tucker · ziyu wang · Alexander Novikov · Sherry Yang · Michael Zhang · Yutian Chen · Aviral Kumar · Cosmin Paduraru · Sergey Levine · Thomas Paine |
|
Poster
|
Tue 17:00 |
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization Michael Zhang · Thomas Paine · Ofir Nachum · Cosmin Paduraru · George Tucker · ziyu wang · Mohammad Norouzi |
|
Spotlight
|
Thu 3:25 |
UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers Siyi Hu · Fengda Zhu · Xiaojun Chang · Xiaodan Liang |
|
Poster
|
Tue 9:00 |
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics Yanchao Sun · Da Huo · Furong Huang |
|
Poster
|
Mon 17:00 |
UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers Siyi Hu · Fengda Zhu · Xiaojun Chang · Xiaodan Liang |