Skip to yearly menu bar Skip to main content


MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Guijin Son ⋅ Dongkeun Yoon ⋅ Juyoung Suk ⋅ Javier Aula-Blasco ⋅ Mano Aslan ⋅ Kim Vu ⋅ Shayekh Islam ⋅ Jaume Prats-Cristià ⋅ Lucía Tormo-Bañuelos ⋅ Seungone Kim

Abstract

Chat is not available.