Skip to yearly menu bar Skip to main content


SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

Zhangchen Xu · Fengqing Jiang · Luyao Niu · Jinyuan Jia · Bill Yuchen Lin · Radha Poovendran

Abstract

Chat is not available.