Skip to yearly menu bar Skip to main content


Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates

Hui Wei ⋅ Shenghua He ⋅ Tian Xia ⋅ Fei Liu ⋅ Andy Wong ⋅ Jingyang Lin ⋅ Mei Han

Abstract

Chat is not available.