Skip to yearly menu bar Skip to main content


Poster

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Maksym Andriushchenko ⋅ Alexandra Souly ⋅ Mateusz Dziemian ⋅ Derek Duenas ⋅ Maxwell Lin ⋅ Justin Wang ⋅ Dan Hendrycks ⋅ Andy Zou ⋅ Zico Kolter ⋅ Matt Fredrikson ⋅ Yarin Gal ⋅ Xander Davies
2025 Poster

Abstract

Video

Chat is not available.