Skip to yearly menu bar Skip to main content


Poster

The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Language Models

Yan Liu ⋅ Yu Liu ⋅ Xiaokang Chen ⋅ Pin-Yu Chen ⋅ Daoguang Zan ⋅ Min-Yen Kan ⋅ Tsung-Yi Ho
2024 Poster

Abstract

Video

Chat is not available.