firstbacksecondback
25 Results
Poster
|
Wed 2:30 |
Hybrid RL: Using both offline and online data can make RL efficient Yuda Song · Yifei Zhou · Ayush Sekhari · Drew Bagnell · Akshay Krishnamurthy · Wen Sun |
|
Oral
|
Tue 6:40 |
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization Haoran Xu · Li Jiang · Jianxiong Li · Zhuoran Yang · Zhaoran Wang · Wai Chan · Xianyuan Zhan |
|
Poster
|
Tue 7:30 |
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization Haoran Xu · Li Jiang · Jianxiong Li · Zhuoran Yang · Zhaoran Wang · Wai Chan · Xianyuan Zhan |
|
Poster
|
Tue 7:30 |
Extreme Q-Learning: MaxEnt RL without Entropy Divyansh Garg · Joey Hejna · Matthieu Geist · Stefano Ermon |
|
Poster
|
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian Paria Rashidinejad · Hanlin Zhu · Kunhe Yang · Stuart Russell · Jiantao Jiao |
||
Poster
|
Mon 2:30 |
Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL Baiting Zhu · Meihua Dang · Aditya Grover |
|
Oral
|
Mon 1:30 |
Does Zero-Shot Reinforcement Learning Exist? Ahmed Touati · Jérémy Rapin · Yann Ollivier |
|
Oral
|
Mon 1:50 |
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes Aviral Kumar · Rishabh Agarwal · Xinyang Geng · George Tucker · Sergey Levine |
|
Poster
|
Mon 2:30 |
Does Zero-Shot Reinforcement Learning Exist? Ahmed Touati · Jérémy Rapin · Yann Ollivier |
|
Poster
|
Mon 2:30 |
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes Aviral Kumar · Rishabh Agarwal · Xinyang Geng · George Tucker · Sergey Levine |
|
Poster
|
Wed 7:30 |
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning Zhendong Wang · Jonathan J Hunt · Mingyuan Zhou |
|
Poster
|
Mon 2:30 |
VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training Yecheng Jason Ma · Shagun Sodhani · Dinesh Jayaraman · Osbert Bastani · Vikash Kumar · Amy Zhang |