Skip to yearly menu bar Skip to main content


Poster

On the Role of Attention Heads in Large Language Model Safety

Zhenhong Zhou ⋅ Haiyang Yu ⋅ Xinghua Zhang ⋅ Rongwu Xu ⋅ Fei Huang ⋅ Kun Wang ⋅ Yang Liu ⋅ Junfeng Fang ⋅ Yongbin Li
2025 Poster

Abstract

Video

Chat is not available.