Skip to yearly menu bar Skip to main content


Same Question, Different Lies: Cross-Context Consistency (C³) for Black-Box Sandbagging Detection

Lin Yulong ⋅ Pablo Bernabeu-Perez ⋅ Benjamin Arnav ⋅ Lennie Wells ⋅ Mary Phuong

Abstract

Log in and register to view live content