firstbacksecondback
237 Results
Spotlight
|
Wed 10:30 |
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization Aviral Kumar · Rishabh Agarwal · Tengyu Ma · Aaron Courville · George Tucker · Sergey Levine |
|
Poster
|
Wed 10:30 |
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning Jinxin Liu · Hongyin Zhang · Donglin Wang |
|
Spotlight
|
Mon 18:30 |
Generalized Decision Transformer for Offline Hindsight Information Matching Hiroki Furuta · Yutaka Matsuo · Shixiang Gu |
|
Poster
|
Mon 10:30 |
Should I Run Offline Reinforcement Learning or Behavioral Cloning? Aviral Kumar · Joey Hong · Anikait Singh · Sergey Levine |
|
Poster
|
Tue 10:30 |
Learning Value Functions from Undirected State-only Experience Matthew Chang · Arjun Gupta · Saurabh Gupta |
|
Poster
|
Wed 10:30 |
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization Aviral Kumar · Rishabh Agarwal · Tengyu Ma · Aaron Courville · George Tucker · Sergey Levine |
|
Poster
|
Mon 18:30 |
Generalized Decision Transformer for Offline Hindsight Information Matching Hiroki Furuta · Yutaka Matsuo · Shixiang Gu |
|
Poster
|
Mon 10:30 |
Offline Reinforcement Learning with Implicit Q-Learning Ilya Kostrikov · Ashvin Nair · Sergey Levine |
|
Poster
|
Wed 10:30 |
RvS: What is Essential for Offline RL via Supervised Learning? Scott Emmons · Benjamin Eysenbach · Ilya Kostrikov · Sergey Levine |
|
Poster
|
Tue 2:30 |
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems Youngsoo Jang · Jongmin Lee · Kee-Eung Kim |
|
Spotlight
|
Mon 18:30 |
Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration Desik Rengarajan · Gargi Vaidya · Akshay Sarvesh · Dileep Kalathil · Srinivas Shakkottai |
|
Poster
|
Mon 18:30 |
Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration Desik Rengarajan · Gargi Vaidya · Akshay Sarvesh · Dileep Kalathil · Srinivas Shakkottai |