firstbacksecondback
316 Results
Poster
|
Tue 2:30 |
Imitating Human Behaviour with Diffusion Models Tim Pearce · Tabish Rashid · Anssi Kanervisto · David Bignell · Mingfei Sun · Raluca Georgescu · Sergio Valcarcel Macua · Shan Zheng Tan · Ida Momennejad · Katja Hofmann · Sam Devlin |
|
Poster
|
Wed 7:30 |
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective Raj Ghugare · Homanga Bharadhwaj · Benjamin Eysenbach · Sergey Levine · Russ Salakhutdinov |
|
Poster
|
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm Toygun Basaklar · Suat Gumussoy · Umit Ogras |
||
Poster
|
Memory Gym: Partially Observable Challenges to Memory-Based Agents Marco Pleines · Matthias Pallasch · Frank Zimmer · Mike Preuss |
||
Poster
|
Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies Rui Yuan · Simon Du · Robert M. Gower · Alessandro Lazaric · Lin Xiao |
||
Poster
|
Tue 7:30 |
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games Samuel Sokota · Ryan D'Orazio · Zico Kolter · Nicolas Loizou · Marc Lanctot · Ioannis Mitliagkas · Noam Brown · Christian Kroer |
|
Workshop
|
Thu 4:00 |
Principled Reinforcement Learning with Human Feedback from Pairwise or -wise Comparisons Banghua Zhu · Jiantao Jiao · Michael Jordan |
|
Poster
|
Deep Reinforcement Learning for Cost-Effective Medical Diagnosis Zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang |
||
Poster
|
Tue 7:30 |
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning Mikayel Samvelyan · Akbir Khan · Michael Dennis · Minqi Jiang · Jack Parker-Holder · Jakob Foerster · Roberta Raileanu · Tim Rocktaeschel |
|
Poster
|
Achieving Near-Optimal Individual Regret & Low Communications in Multi-Agent Bandits Xuchuang Wang · Lin Yang · Yu-Zhen Janice Chen · Xutong Liu · Mohammad Hajiesmaili · Don Towsley · John C.S. Lui |
||
Poster
|
Tue 2:30 |
Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning Dianbo Liu · Vedant Shah · Oussama Boussif · Cristian Meo · Anirudh Goyal · Tianmin Shu · Michael Mozer · Nicolas Heess · Yoshua Bengio |
|
Poster
|
Hyper-Decision Transformer for Efficient Online Policy Adaptation Mengdi Xu · Yuchen Lu · Yikang Shen · Shun Zhang · DING ZHAO · Chuang Gan |