Skip to yearly menu bar Skip to main content


Poster
in
Workshop: How Far Are We From AGI

Rethinking harmless refusals when fine-tuning foundation models

Florin Pop ⋅ Judd Rosenblatt ⋅ Diogo de Lucena ⋅ Michael Vaiana

Abstract

Chat is not available.