Skip to yearly menu bar Skip to main content


Poster Thu, Apr 23, 2026 • 6:30 AM – 9:00 AM PDT

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Yuchen Yan · Jin Jiang · Zhenbang Ren · Yijun Li · Xudong Cai · Yang Liu · Xin Xu · Mengdi Zhang · Jian Shao · Yongliang Shen · Jun Xiao · Yueting Zhuang

Abstract

Log in and register to view live content