firstbacksecondback
334 Results
Poster
|
Mon 2:30 |
Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model Zhihai Wang · Xijun Li · Jie Wang · Yufei Kuang · Mingxuan Yuan · Jia Zeng · Yongdong Zhang · Feng Wu |
|
Poster
|
Mon 2:30 |
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation Yannick Hogewind · Thiago D. Simão · Tal Kachman · Nils Jansen |
|
Poster
|
Making Better Decision by Directly Planning in Continuous Control Jinhua Zhu · Yue Wang · Lijun Wu · Tao Qin · Wengang Zhou · Tie-Yan Liu · Houqiang Li |
||
Oral
|
Mon 6:30 |
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier Pierluca D'Oro · Max Schwarzer · Evgenii Nikishin · Pierre-Luc Bacon · Marc G Bellemare · Aaron Courville |
|
Poster
|
Mon 7:30 |
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier Pierluca D'Oro · Max Schwarzer · Evgenii Nikishin · Pierre-Luc Bacon · Marc G Bellemare · Aaron Courville |
|
Poster
|
Tue 7:30 |
Population-size-Aware Policy Optimization for Mean-Field Games Pengdeng Li · Xinrun Wang · Shuxin Li · Hau Chan · Bo An |
|
Poster
|
Tue 2:30 |
SMART: Self-supervised Multi-task pretrAining with contRol Transformers Yanchao Sun · shuang ma · Ratnesh Madaan · Rogerio Bonatti · Furong Huang · Ashish Kapoor |
|
Workshop
|
Thu 4:00 |
Aligning Foundation Models for Language with Preferences through f-divergence Minimization Dongyoung Go · Tomek Korbak · Germàn Kruszewski · Jos Rozen · Nahyeon Ryu · Marc Dymetman |
|
Poster
|
Mon 7:30 |
Near-optimal Policy Identification in Active Reinforcement Learning Xiang Li · Viraj Mehta · Johannes Kirschner · Ian Char · Willie Neiswanger · Jeff Schneider · Andreas Krause · Ilija Bogunovic |
|
Poster
|
Mon 2:30 |
Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL Baiting Zhu · Meihua Dang · Aditya Grover |
|
Oral
|
Mon 6:50 |
Near-optimal Policy Identification in Active Reinforcement Learning Xiang Li · Viraj Mehta · Johannes Kirschner · Ian Char · Willie Neiswanger · Jeff Schneider · Andreas Krause · Ilija Bogunovic |
|
Poster
|
O(T−1) Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games Yuepeng Yang · Cong Ma |