Skip to yearly menu bar Skip to main content


From Data to Behavior: Predicting Unintended Model Behaviors Before Training

Mengru Wang ⋅ Zhenqian Xu ⋅ Junfeng Fang ⋅ Yunzhi Yao ⋅ Shumin Deng ⋅ Huajun Chen ⋅ Ningyu Zhang

Abstract

Log in and register to view live content