Skip to yearly menu bar Skip to main content


RAPO: Risk-Aware Preference Optimization for Generalizable Safe Reasoning

Zeming Wei ⋅ Qiaosheng Zhang ⋅ Xia Hu ⋅ Xingcheng Xu

Abstract

Log in and register to view live content