Skip to yearly menu bar Skip to main content


Poster

SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests

Punya Syon Pandey · Lê Sơn · Devansh Bhardwaj · Zhijing Jin

Abstract

Log in and register to view live content