firstbacksecondback
334 Results
Workshop
|
Thu 4:00 |
Principled Reinforcement Learning with Human Feedback from Pairwise or -wise Comparisons Banghua Zhu · Jiantao Jiao · Michael Jordan |
|
Poster
|
Deep Reinforcement Learning for Cost-Effective Medical Diagnosis Zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang |
||
Poster
|
Achieving Near-Optimal Individual Regret & Low Communications in Multi-Agent Bandits Xuchuang Wang · Lin Yang · Yu-Zhen Janice Chen · Xutong Liu · Mohammad Hajiesmaili · Don Towsley · John C.S. Lui |
||
Poster
|
Tue 2:30 |
Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning Dianbo Liu · Vedant Shah · Oussama Boussif · Cristian Meo · Anirudh Goyal · Tianmin Shu · Michael Mozer · Nicolas Heess · Yoshua Bengio |
|
Poster
|
Tue 7:30 |
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning Mikayel Samvelyan · Akbir Khan · Michael Dennis · Minqi Jiang · Jack Parker-Holder · Jakob Foerster · Roberta Raileanu · Tim Rocktaeschel |
|
Poster
|
Hyper-Decision Transformer for Efficient Online Policy Adaptation Mengdi Xu · Yuchen Lu · Yikang Shen · Shun Zhang · DING ZHAO · Chuang Gan |
||
Poster
|
Mon 2:30 |
Interaction-Based Disentanglement of Entities for Object-Centric World Models Akihiro Nakano · Masahiro Suzuki · Yutaka Matsuo |
|
Poster
|
Mon 2:30 |
Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots Wei Hung · Bo Kai Huang · Ping-Chun Hsieh · Xi Liu |
|
Oral
|
Tue 6:30 |
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware Nico Gürtler · Sebastian Blaes · Pavel Kolev · Felix Widmaier · Manuel Wuthrich · Stefan Bauer · Bernhard Schoelkopf · Georg Martius |
|
Poster
|
Tue 7:30 |
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware Nico Gürtler · Sebastian Blaes · Pavel Kolev · Felix Widmaier · Manuel Wuthrich · Stefan Bauer · Bernhard Schoelkopf · Georg Martius |