Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Representational Alignment

On the Identifiability of Steering Vectors in Large Language Models

Sohan Venkatesh ⋅ Ashish Kurapath

Abstract

Chat is not available.