Skip to yearly menu bar Skip to main content


AdvBDGen: A Robust Framework for Generating Adaptive and Stealthy Backdoors in LLM Alignment Attacks

Pankayaraj Pathmanathan ⋅ Udari Sehwag ⋅ Michael-Andrei Panaitescu-Liess ⋅ Furong Huang

Abstract

Chat is not available.