Processing math: 100%
Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

334 Results

<<   <   Page 26 of 28   >   >>
Poster
Mon 2:30 Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model
Zhihai Wang · Xijun Li · Jie Wang · Yufei Kuang · Mingxuan Yuan · Jia Zeng · Yongdong Zhang · Feng Wu
Poster
Mon 2:30 Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind · Thiago D. Simão · Tal Kachman · Nils Jansen
Poster
Making Better Decision by Directly Planning in Continuous Control
Jinhua Zhu · Yue Wang · Lijun Wu · Tao Qin · Wengang Zhou · Tie-Yan Liu · Houqiang Li
Oral
Mon 6:30 Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
Pierluca D'Oro · Max Schwarzer · Evgenii Nikishin · Pierre-Luc Bacon · Marc G Bellemare · Aaron Courville
Poster
Mon 7:30 Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
Pierluca D'Oro · Max Schwarzer · Evgenii Nikishin · Pierre-Luc Bacon · Marc G Bellemare · Aaron Courville
Poster
Tue 7:30 Population-size-Aware Policy Optimization for Mean-Field Games
Pengdeng Li · Xinrun Wang · Shuxin Li · Hau Chan · Bo An
Poster
Tue 2:30 SMART: Self-supervised Multi-task pretrAining with contRol Transformers
Yanchao Sun · shuang ma · Ratnesh Madaan · Rogerio Bonatti · Furong Huang · Ashish Kapoor
Workshop
Thu 4:00 Aligning Foundation Models for Language with Preferences through f-divergence Minimization
Dongyoung Go · Tomek Korbak · Germàn Kruszewski · Jos Rozen · Nahyeon Ryu · Marc Dymetman
Poster
Mon 7:30 Near-optimal Policy Identification in Active Reinforcement Learning
Xiang Li · Viraj Mehta · Johannes Kirschner · Ian Char · Willie Neiswanger · Jeff Schneider · Andreas Krause · Ilija Bogunovic
Poster
Mon 2:30 Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL
Baiting Zhu · Meihua Dang · Aditya Grover
Oral
Mon 6:50 Near-optimal Policy Identification in Active Reinforcement Learning
Xiang Li · Viraj Mehta · Johannes Kirschner · Ian Char · Willie Neiswanger · Jeff Schneider · Andreas Krause · Ilija Bogunovic
Poster
O(T1) Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games
Yuepeng Yang · Cong Ma