Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Representational Alignment

Steered LLM Activations are Non-Surjective

Aayush Mishra ⋅ Daniel Khashabi ⋅ Anqi Liu

Abstract

Chat is not available.