firstbacksecondback
659 Results
Poster
|
Mon 7:30 |
Near-optimal Policy Identification in Active Reinforcement Learning Xiang Li · Viraj Mehta · Johannes Kirschner · Ian Char · Willie Neiswanger · Jeff Schneider · Andreas Krause · Ilija Bogunovic |
|
Poster
|
Making Better Decision by Directly Planning in Continuous Control Jinhua Zhu · Yue Wang · Lijun Wu · Tao Qin · Wengang Zhou · Tie-Yan Liu · Houqiang Li |
||
Poster
|
PEER: A Collaborative Language Model Timo Schick · Jane Dwivedi-Yu · Zhengbao Jiang · Fabio Petroni · Patrick Lewis · Gautier Izacard · Qingfei You · Christoforos Nalmpantis · Edouard Grave · Sebastian Riedel |
||
Poster
|
Achieving Near-Optimal Individual Regret & Low Communications in Multi-Agent Bandits Xuchuang Wang · Lin Yang · Yu-Zhen Janice Chen · Xutong Liu · Mohammad Hajiesmaili · Don Towsley · John C.S. Lui |
||
Poster
|
Wed 7:30 |
MA-BERT: Towards Matrix Arithmetic-only BERT Inference by Eliminating Complex Non-Linear Functions Wei Ming Neo · Zhehui Wang · Cheng Liu · Rick Goh · Tao Luo |
|
Poster
|
Wed 2:30 |
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions Ansong Ni · Jeevana Priya Inala · Chenglong Wang · Alex Polozov · Christopher Meek · Dragomir Radev · Jianfeng Gao |
|
Poster
|
FoSR: First-order spectral rewiring for addressing oversquashing in GNNs Kedar Karhadkar · Pradeep Banerjee · Guido Montufar |
||
Poster
|
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs Yuan Cheng · Ruiquan Huang · Yingbin Liang · Jing Yang |
||
Poster
|
Wed 7:30 |
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective Raj Ghugare · Homanga Bharadhwaj · Benjamin Eysenbach · Sergey Levine · Russ Salakhutdinov |
|
Poster
|
Wed 7:30 |
Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation Marius-Constantin Dinu · Markus Holzleitner · Maximilian Beck · Hoan Nguyen · Andrea Huber · Hamid Eghbalzadeh · Bernhard A. Moser · Sergei Pereverzyev · Sepp Hochreiter · Werner Zellinger |
|
Oral
|
Wed 6:20 |
Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation Marius-Constantin Dinu · Markus Holzleitner · Maximilian Beck · Hoan Nguyen · Andrea Huber · Hamid Eghbalzadeh · Bernhard A. Moser · Sergei Pereverzyev · Sepp Hochreiter · Werner Zellinger |