Skip to yearly menu bar Skip to main content


Poster
in
Workshop: AI for Peace

SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests

Punya Syon Pandey ⋅ Lê Sơn ⋅ Devansh Bhardwaj ⋅ Rada Mihalcea ⋅ Zhijing Jin

Abstract

Chat is not available.