Skip to yearly menu bar Skip to main content


Poster

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

Noam Razin ⋅ Sadhika Malladi ⋅ Adithya Bhaskar ⋅ Danqi Chen ⋅ Sanjeev Arora ⋅ Boris Hanin
2025 Poster

Abstract

Video

Chat is not available.