Skip to yearly menu bar Skip to main content


Why Do Language Model Agents Whistleblow?

Kushal Agrawal ⋅ Frank Xiao ⋅ Guido Bergman ⋅ Asa Stickland

Abstract

Chat is not available.