Skip to yearly menu bar Skip to main content


Poster

Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation

Xinpeng Wang · Chengzhi (Martin) Hu · Paul Röttger · Barbara Plank
2025 Poster

Abstract

Video

Chat is not available.