firstbacksecondback
70 Results
Workshop
|
Sat 2:40 |
ADVANCING DNA LANGUAGE MODELS: THE GENOMICS LONG-RANGE BENCHMARK Chia Hsiang Kao · Evan Trop · McKinley Polen · Yair Schiff · Bernardo Almeida · Aaron Gokaslan · Thomas PIERROT · Volodymyr Kuleshov |
|
Workshop
|
WAVES: Benchmarking the Robustness of Image Watermarks Tahseen Rabbani · Bang An · Mucong Ding · Aakriti Agrawal · Yuancheng Xu · Chenghao Deng · Sicheng Zhu · Abdirisak Mohamed · Yuxin Wen · Tom Goldstein · Furong Huang |
||
Workshop
|
A BENCHMARK FOR GEOGRAPHIC DISTRIBUTION SHIFT IN SMALLHOLDER AGROFORESTRY: DO FOUNDATION MODELS IMPROVE OOD GENERALIZATION? Siddharth Sachdeva · Chandrasekhar Biradar · Isabel Lopez · David Lobell |
||
Workshop
|
How to benchmark AGI: the Adversarial Game Emmanouil Seferis |
||
Workshop
|
Don't Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget Florian Eddie Dorner · Moritz Hardt |
||
Workshop
|
PostRainBench: A comprehensive benchmark and a new model for precipitation forecasting Yujin Tang · Jiaming Zhou · Xiang Pan · Zeying Gong · Junwei Liang |
||
Workshop
|
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution Alex Gu · Baptiste Roziere · Hugh Leather · Armando Solar-Lezama · Gabriel Synnaeve · Sida Wang |
||
Workshop
|
Sat 2:40 |
DARKIN: A zero-shot classification benchmark and an evaluation of protein language models Emine Ayşe Sunar · Zeynep Işık · Mert Pekey · Ramazan Gokberk Cinbis · Oznur Tastan |
|
Workshop
|
The Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design Chase Armer · Henning Redestig · Erika DeBenedictis · Hassan Kane · Dana Cortade · Peter Kelly · Adil Yusuf · TJ Brunette |
||
Workshop
|
Long-Range Synthetic Knowledge Graph Benchmarks for Double-Equivariant Models Bruna Jasinowodolinski · Yucheng Zhang · Jincheng Zhou · Beatrice Bevilacqua · Bruno Ribeiro |
||
Workshop
|
BOLAA: BENCHMARKING AND ORCHESTRATING LLM AUTONOMOUS AGENTS Zhiwei Liu · Weiran Yao · Jianguo Zhang · Le Xue · Shelby Heinecke · Rithesh Murthy · Yihao Feng · Zeyuan Chen · Juan Carlos Niebles · Devansh Arpit · Ran Xu · Phil Mui · Huan Wang · Caiming Xiong · Silvio Savarese |
||
Workshop
|
Medical Event Data Standard (MEDS): Facilitating Machine Learning for Health Bert Arnrich · Edward Choi · Jason Fries · Matthew McDermott · Jungwoo Oh · Tom Pollard · Nigam Shah · Ethan Steinberg · Michael Wornow · Robin van de Water |