firstbacksecondback
19 Results
Poster
|
Thu 18:30 |
Extending the WILDS Benchmark for Unsupervised Adaptation Shiori Sagawa · Pang Wei Koh · Tony Lee · Irena Gao · Sang Michael Xie · Kendrick Shen · Ananya Kumar · Weihua Hu · Michihiro Yasunaga · Henrik Marklund · Sara Beery · Etienne David · Ian Stavness · Wei Guo · Jure Leskovec · Kate Saenko · Tatsunori Hashimoto · Sergey Levine · Chelsea Finn · Percy Liang |
|
Poster
|
Wed 10:30 |
GeneDisco: A Benchmark for Experimental Design in Drug Discovery Arash Mehrjou · Ashkan Soleymani · Andrew Jesson · Pascal Notin · Yarin Gal · Stefan Bauer · Patrick Schwab |
|
Poster
|
Wed 18:30 |
Benchmarking the Spectrum of Agent Capabilities Danijar Hafner |
|
Poster
|
Tue 10:30 |
Surrogate NAS Benchmarks: Going Beyond the Limited Search Spaces of Tabular NAS Benchmarks Arber Zela · Julien Niklas Siems · Lucas Zimmer · Jovita Lukasik · Margret Keuper · Frank Hutter |
|
Poster
|
Wed 18:30 |
Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations Jiaheng Wei · Zhaowei Zhu · Hao Cheng · Tongliang Liu · Gang Niu · Yang Liu |
|
Poster
|
Tue 2:30 |
miniF2F: a cross-system benchmark for formal Olympiad-level mathematics Kunhao Zheng · Jesse Han · Stanislas Polu |
|
Oral
|
Wed 10:15 |
Extending the WILDS Benchmark for Unsupervised Adaptation Shiori Sagawa · Pang Wei Koh · Tony Lee · Irena Gao · Sang Michael Xie · Kendrick Shen · Ananya Kumar · Weihua Hu · Michihiro Yasunaga · Henrik Marklund · Sara Beery · Etienne David · Ian Stavness · Wei Guo · Jure Leskovec · Kate Saenko · Tatsunori Hashimoto · Sergey Levine · Chelsea Finn · Percy Liang |
|
Workshop
|
An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes Equations Florent Bonnet · Ahmed Mazari · Thibaut Munzer · Pierre Yser · patrick gallinari |
||
Workshop
|
Improving the assessment of deep learning models in the context of drug-target interaction prediction Mirko Torrisi · Antonio De la Vega de Leon · Guillermo Climent · Remco Loos · Alejandro Panjkovich |
||
Workshop
|
An evaluation framework for the objective functions of de novo drug design benchmarks Austin Tripp · Wenlin Chen · José Miguel Hernández Lobato |
||
Workshop
|
Benchmarking Uncertainty Quantification for Protein Engineering Kevin P Greenman · Ava Soleimany · Kevin K Yang |
||
Workshop
|
SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning Michael Matthews · Mikayel Samvelyan · Jack Parker-Holder · Edward Grefenstette · Tim Rocktaeschel |