Skip to yearly menu bar Skip to main content


Poster
in
Workshop: How Far Are We From AGI

Rethinking harmless refusals when fine-tuning foundation models

Florin Pop · Judd Rosenblatt · Diogo de Lucena · Michael Vaiana

Abstract

Chat is not available.