Skip to yearly menu bar Skip to main content


Poster

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

Junkang Wu · Yuexiang Xie · Zhengyi Yang · Jiancan Wu · Jiawei Chen · Jinyang Gao · Bolin Ding · Xiang Wang · Xiangnan He
2025 Poster

Abstract

Video

Chat is not available.