firstbacksecondback
757 Results
Oral
|
Mon 1:50 |
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes Aviral Kumar · Rishabh Agarwal · Xinyang Geng · George Tucker · Sergey Levine |
|
Poster
|
Wed 7:30 |
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning Zhendong Wang · Jonathan J Hunt · Mingyuan Zhou |
|
Poster
|
Wed 7:30 |
Reward Design with Language Models Minae Kwon · Sang Michael Xie · Kalesha Bullard · Dorsa Sadigh |
|
Oral
|
Tue 6:00 |
Transformers are Sample-Efficient World Models Vincent Micheli · Eloi Alonso · François Fleuret |
|
Poster
|
Tue 7:30 |
Transformers are Sample-Efficient World Models Vincent Micheli · Eloi Alonso · François Fleuret |
|
Oral
|
Tue 1:30 |
On the Sensitivity of Reward Inference to Misspecified Human Models Joey Hong · Kush Bhatia · Anca Dragan |
|
Poster
|
Tue 2:30 |
On the Sensitivity of Reward Inference to Misspecified Human Models Joey Hong · Kush Bhatia · Anca Dragan |
|
Workshop
|
Model-Based Adversarial Imitation Learning As Online Fine-Tuning Rafael Rafailov · Victor Kolev · Kyle Hatch · John Martin · mariano Phielipp · Jiajun Wu · Chelsea Finn |
||
Oral
|
Tue 6:00 |
Planning Goals for Exploration Edward Hu · Richard Chang · Oleh Rybkin · Dinesh Jayaraman |
|
Poster
|
Tue 7:30 |
Planning Goals for Exploration Edward Hu · Richard Chang · Oleh Rybkin · Dinesh Jayaraman |
|
Poster
|
Tue 2:30 |
LDMIC: Learning-based Distributed Multi-view Image Coding Xinjie Zhang · Jiawei Shao · Jun Zhang |
|
Poster
|
Tue 7:30 |
Choreographer: Learning and Adapting Skills in Imagination Pietro Mazzaglia · Tim Verbelen · Bart Dhoedt · Alexandre Lacoste · Sai Rajeswar |