Skip to yearly menu bar Skip to main content


(6 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Apr 24 12:30 AM -- 12:42 AM (PDT) @ Garnet 213-215 None
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Jun Shern Chan · Neil Chowdhury · Oliver Jaffe · James Aung · Dane Sherburn · Evan Mays · Giulio Starace · Kevin Liu · Leon Maksin · Tejal Patwardhan · Aleksander Madry · Lilian Weng
[ OpenReview
Oral
Thu Apr 24 12:42 AM -- 12:54 AM (PDT) @ Garnet 213-215 None
MMQA: Evaluating LLMs with Multi-Table Multi-Hop Complex Questions
Jian Wu · Linyi Yang · Dongyuan Li · Yuliang Ji · Manabu Okumura · Yue Zhang
[ OpenReview
Oral
Thu Apr 24 12:54 AM -- 01:06 AM (PDT) @ Garnet 213-215 None
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Peng Xia · Siwei Han · Shi Qiu · Yiyang Zhou · Zhaoyang Wang · Wenhao Zheng · Zhaorun Chen · Chenhang Cui · Mingyu Ding · Linjie Li · Lijuan Wang · Huaxiu Yao
[ OpenReview
Oral
Thu Apr 24 01:06 AM -- 01:18 AM (PDT) @ Garnet 213-215 None
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Yue Yang · Shuibo Zhang · Kaipeng Zhang · Yi Bin · Yu Wang · Ping Luo · Wenqi Shao
[ OpenReview
Oral
Thu Apr 24 01:18 AM -- 01:30 AM (PDT) @ Garnet 213-215 None
PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration
Yuxuan Sun · Yunlong Zhang · Yixuan Si · Chenglu Zhu · Kai Zhang · Zhongyi Shui · Jingxiong Li · Xuan Gong · XINHENG LYU · Tao Lin · Lin Yang
[ OpenReview
Oral
Thu Apr 24 01:30 AM -- 01:42 AM (PDT) @ Garnet 213-215 None
Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models
Simon Schrodi · David T. Hoffmann · Max Argus · Volker Fischer · Thomas Brox
[ OpenReview