Skip to yearly menu bar Skip to main content


Poster

Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron

Yiran Zhao · Wenxuan Zhang · Yuxi Xie · Anirudh Goyal · Kenji Kawaguchi · Michael Qizhe Shieh
2025 Poster

Abstract

Video

Chat is not available.