firstbacksecondback
70 Results
Workshop
|
A Benchmark Dataset for Meteorological Downscaling Michael Langguth · Paula Harder · Irene Schicker · Ankit Patnala · Sebastian Lehner · Konrad Mayer · Markus Dabernig |
||
Workshop
|
RD2Bench: Toward Data-Centric Automatic R&D Haotian Chen · Xinjie Shen · Zeqi Ye · Xiao Yang · Xu Yang · Weiqing Liu · Jiang Bian |
||
Workshop
|
Sat 7:15 |
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data Chenhui Zhang · Sherrie Wang |
|
Workshop
|
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT Jon Saad-Falcon · Dan Fu · Simran Arora · Neel Guha · Christopher Re |
||
Poster
|
Thu 7:30 |
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy Simon Ging · Maria A. Bravo · Thomas Brox |
|
Poster
|
Thu 7:30 |
WOODS: Benchmarks for Out-of-Distribution Generalization in Time Series Irina Rish · Kartik Ahuja · Mohammad Javad Darvishi Bayazi · Pooneh Mousavi · Guillaume Dumas · Jean-Christophe Gagnon-Audet |
|
Poster
|
Fri 1:45 |
Does Progress On Object Recognition Benchmarks Improve Generalization on Crowdsourced, Global Data? Megan Richards · Polina Kirichenko · Diane Bouchacourt · Mark Ibrahim |
|
Affinity Workshop
|
Tue 1:45 |
An Evaluation Benchmark for Autoformalization in Lean4 Jasdeep Sidhu · Shubhra Mishra |
|
Poster
|
Wed 7:30 |
A Benchmark for Learning to Translate a New Language from One Grammar Book Garrett Tanzer · Mirac Suzgun · Eline Visser · Dan Jurafsky · Luke Melas-Kyriazi |
|
Poster
|
Tue 7:30 |
Benchmarking and Improving Generator-Validator Consistency of Language Models XIANG LI · Vaishnavi Shrivastava · Siyan Li · Tatsunori Hashimoto · Percy Liang |
|
Poster
|
Fri 1:45 |
RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems Tianyang Liu · Canwen Xu · Julian McAuley |
|
Poster
|
Wed 1:45 |
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use Yue Huang · Jiawen Shi · Yuan Li · Chenrui Fan · Siyuan Wu · Qihui Zhang · Yixin Liu · Pan Zhou · Yao Wan · Neil Gong · Lichao Sun |