firstbacksecondback
334 Results
Poster
|
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function Ruijie Zheng · Xiyao Wang · Huazhe Xu · Furong Huang |
||
Poster
|
Neural DAG Scheduling via One-Shot Priority Sampling Wonseok Jeon · Mukul Gagrani · Burak Bartan · Weiliang Zeng · Harris Teague · Piero Zappi · Christopher Lott |
||
Oral
|
Tue 1:50 |
SMART: Self-supervised Multi-task pretrAining with contRol Transformers Yanchao Sun · shuang ma · Ratnesh Madaan · Rogerio Bonatti · Furong Huang · Ashish Kapoor |
|
Poster
|
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data Fuxiang Zhang · Chengxing Jia · Yi-Chen Li · Lei Yuan · Yang Yu · Zongzhang Zhang |
||
Poster
|
Wed 7:30 |
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective Raj Ghugare · Homanga Bharadhwaj · Benjamin Eysenbach · Sergey Levine · Russ Salakhutdinov |
|
Poster
|
Tue 2:30 |
Imitating Human Behaviour with Diffusion Models Tim Pearce · Tabish Rashid · Anssi Kanervisto · David Bignell · Mingfei Sun · Raluca Georgescu · Sergio Valcarcel Macua · Shan Zheng Tan · Ida Momennejad · Katja Hofmann · Sam Devlin |
|
Poster
|
Memory Gym: Partially Observable Challenges to Memory-Based Agents Marco Pleines · Matthias Pallasch · Frank Zimmer · Mike Preuss |
||
Poster
|
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm Toygun Basaklar · Suat Gumussoy · Umit Ogras |
||
Poster
|
Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies Rui Yuan · Simon Du · Robert M. Gower · Alessandro Lazaric · Lin Xiao |
||
Poster
|
Tue 7:30 |
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games Samuel Sokota · Ryan D'Orazio · Zico Kolter · Nicolas Loizou · Marc Lanctot · Ioannis Mitliagkas · Noam Brown · Christian Kroer |
|
Oral
|
Tue 6:30 |
Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts Zhili LIU · Kai Chen · Jianhua Han · Lanqing HONG · Hang Xu · Zhenguo Li · James Kwok |
|
Poster
|
Tue 7:30 |
Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts Zhili LIU · Kai Chen · Jianhua Han · Lanqing HONG · Hang Xu · Zhenguo Li · James Kwok |