Skip to yearly menu bar Skip to main content


Poster

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

Andy K Zhang · Neil Perry · Riya Dulepet · Joey Ji · Celeste Menders · Justin Lin · Eliot Jones · Gashon Hussein · Samantha Liu · Donovan Jasper · Pura Peetathawatchai · Ari Glenn · Vikram Sivashankar · Daniel Zamoshchin · Leo Glikbarg · Derek Askaryar · Haoxiang Yang · Aolin Zhang · Rishi Alluri · Nathan Tran · Rinnara Sangpisit · Kenny Oseleononmen · Dan Boneh · Daniel Ho · Percy Liang
2025 Poster

Abstract

Video

Chat is not available.