firstbacksecondback
40 Results
Poster
|
Wed 7:30 |
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation Haruka Kiyohara · Ren Kishimoto · Kosuke Kawakami · Ken Kobayashi · Kazuhide Nakata · Yuta Saito |
|
Workshop
|
Prompt Optimization with Logged Bandit Data Haruka Kiyohara · Yuta Saito · Daniel Cao · Thorsten Joachims |
||
Poster
|
Tue 7:30 |
Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies Haanvid Lee · Tri Wahyu Guntara · Jongmin Lee · Yung-Kyun Noh · Kee-Eung Kim |
|
Poster
|
Fri 1:45 |
On Trajectory Augmentations for Off-Policy Evaluation Ge Gao · Qitong Gao · Xi Yang · Song Ju · Miroslav Pajic · Min Chi |
|
Poster
|
Thu 7:30 |
Off-Policy Primal-Dual Safe Reinforcement Learning Zifan Wu · Bo Tang · Qian Lin · Chao Yu · Shangqin Mao · Qianlong Xie · Xingxing Wang · Dong Wang |
|
Poster
|
Tue 7:30 |
Replay across Experiments: A Natural Extension of Off-Policy RL Dhruva Tirumala · Thomas Lampe · Jose Enrique Chen · Tuomas Haarnoja · Sandy Huang · Guy Lever · Ben Moran · Tim Hertweck · Leonard Hasenclever · Martin Riedmiller · Nicolas Heess · Markus Wulfmeier |
|
Poster
|
Thu 7:30 |
Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment Siyao Li · Tianpei Gu · Zhitao Yang · Zhengyu Lin · Ziwei Liu · Henghui Ding · Lei Yang · Chen Change Loy |
|
Workshop
|
OMPO: A Unified Framework for Reinforcement Learning under Policy and Dynamics Shifts Yu Luo · Tianying Ji · Fuchun Sun · Jianwei Zhang · Huazhe Xu · Xianyuan Zhan |
||
Poster
|
Wed 7:30 |
Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning Zihan Ding · Chi Jin |
|
Poster
|
Tue 7:30 |
Time-Efficient Reinforcement Learning with Stochastic Stateful Policies Firas Al-Hafez · Guoping Zhao · Jan Peters · Davide Tateo |
|
Poster
|
Wed 7:30 |
Score Regularized Policy Optimization through Diffusion Behavior Huayu Chen · Cheng Lu · Zhengyi Wang · Hang Su · Jun Zhu |
|
Poster
|
Thu 1:45 |
Blending Imitation and Reinforcement Learning for Robust Policy Improvement Xuefeng Liu · Takuma Yoneda · Rick Stevens · Matthew Walter · Yuxin Chen |