Skip to yearly menu bar Skip to main content


GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Theory

Pepijn Cobben ⋅ Xuanqiang Angelo Huang ⋅ Thao Pham ⋅ Isabel Dahlgren ⋅ Terry Zhang ⋅ Zhijing Jin

Abstract

Log in and register to view live content