Skip to yearly menu bar Skip to main content


Poster

JudgeBench: A Benchmark for Evaluating LLM-Based Judges

Sijun Tan ⋅ Siyuan Zhuang ⋅ Kyle Montgomery ⋅ William Tang ⋅ Alejandro Cuadron ⋅ Chenguang Wang ⋅ Raluca Popa ⋅ Ion Stoica
2025 Poster

Abstract

Video

Chat is not available.