Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Representational Alignment

Atlas-Alignment: Making Interpretability Transferable Across Language Models

Bruno Puri ⋅ Jim Berend ⋅ Sebastian Lapuschkin ⋅ Wojciech Samek

Abstract

Chat is not available.