Skip to yearly menu bar Skip to main content


Subliminal Transfer of Unsafe Behaviors in AI Agent Distillation

Jacob Dang ⋅ Brian Xie ⋅ Omar G. Younis

Abstract

Chat is not available.