Skip to yearly menu bar Skip to main content


ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection

Yibo Yan ⋅ Shen Wang ⋅ Jiahao Huo ⋅ Hang Li ⋅ BOYAN LI ⋅ Jiamin Su ⋅ Xiong Gao ⋅ YiFan Zhang ⋅ Tianlong Xu ⋅ Zhendong Chu ⋅ Aoxiao Zhong ⋅ Kun Wang ⋅ Hui Xiong ⋅ Philip Yu ⋅ Xuming Hu ⋅ Qingsong Wen

Abstract

Chat is not available.