Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Representational Alignment

Does Downstream Fine-Tuning Undo Embedded Activation Steering?

Philipp E. Glass ⋅ Allan Tucker ⋅ Yongmin Li ⋅ Alina Miron

Abstract

Chat is not available.