Skip to yearly menu bar Skip to main content


Poster

MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation

Zhongshen Zeng · Pengguang Chen · Shu Liu · Haiyun Jiang · Jiaya Jia
2025 Poster

Abstract

Video

Chat is not available.