Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Representational Alignment

Safe Downstream Adaptation of LLMs via Refusal-Orthogonal Gated Editing

Nayeema Nonta ⋅ Sirisha Rambhatla

Abstract

Chat is not available.