Skip to yearly menu bar Skip to main content


SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

Zhangchen Xu ⋅ Fengqing Jiang ⋅ Luyao Niu ⋅ Jinyuan Jia ⋅ Bill Yuchen Lin ⋅ Radha Poovendran

Abstract

Chat is not available.