Skip to yearly menu bar Skip to main content


Poster

SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety

Geon-Hyeong Kim · Youngsoo Jang · Yu Jin Kim · Byoungjip Kim · Honglak Lee · Kyunghoon Bae · Moontae Lee

Abstract

Log in and register to view live content