firstbacksecondback
23 Results
Poster
|
Mon 17:00 |
PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics Zhiao Huang · Yuanming Hu · Tao Du · Siyuan Zhou · Hao Su · Joshua B Tenenbaum · Chuang Gan |
|
Poster
|
Tue 17:00 |
Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees? Zhen Qin · Le Yan · Honglei Zhuang · Yi Tay · Rama Kumar Pasumarthi · Xuanhui Wang · Michael Bendersky · Marc Najork |
|
Poster
|
Wed 17:00 |
INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving Yuhuai Wu · Albert Jiang · Jimmy Ba · Roger Grosse |
|
Poster
|
Thu 1:00 |
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning Ossama Ahmed · Frederik Träuble · Anirudh Goyal · Alexander Neitz · Manuel Wuthrich · Yoshua Bengio · Bernhard Schoelkopf · Stefan Bauer |
|
Workshop
|
Fri 7:45 |
Workshop on Enormous Language Models: Perspectives and Benchmarks Colin Raffel · Adam Roberts · Amanda Askell · Daphne Ippolito · Ethan Dyer · Guy Gur-Ari · Jared Kaplan · Jascha Sohl-Dickstein · Katherine Lee · Melanie Subbiah · Sam McCandlish · Tom Brown · William Fedus · Vedant Misra · Ambrose Slone · Daniel Freeman |
|
Workshop
|
FedGraphNN: A Federated Learning System and Benchmark for Graph Neural Networks Chaoyang He · Keshav Balasubramanian · Emir Ceyani · Yu Rong · Junzhou Huang · Murali Annavaram · Salman Avestimehr |
||
Workshop
|
Fri 15:11 |
Contributed Talk #2: RobustBench: a standardized adversarial robustness benchmark francesco croce · Vikash Sehwag · Prateek Mittal · Matthias Hein |
|
Workshop
|
Fri 14:01 |
Contributed Talk 3 - Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks Curtis G Northcutt |
|
Workshop
|
Fri 13:07 |
Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks Curtis G Northcutt |
|
Spotlight
|
Tue 20:40 |
Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees? Zhen Qin · Le Yan · Honglei Zhuang · Yi Tay · Rama Kumar Pasumarthi · Xuanhui Wang · Michael Bendersky · Marc Najork |
|
Workshop
|
RobustBench: a standardized adversarial robustness benchmark francesco croce |