Skip to yearly menu bar Skip to main content


RubricRobustness: Evaluating the Sensitivity of Rubrics-Based Benchmarks to Simple Perturbations

Manasi Sharma ⋅ Bradley Kenstler ⋅ Bing Liu

Abstract

Chat is not available.