Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Representational Alignment

HOW WELL CAN A LARGE LANGUAGE MODEL INFER THE VALUE REPRESENTATIONS EXPRESSED BY ANOTHER INSTANCE OF THE SAME MODEL?

Maryam Ghorbansabagh ⋅ Amir-Hossein Karimi ⋅ Igor Grossmann

Abstract

Chat is not available.