Skip to yearly menu bar Skip to main content


Poster

Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models

Wenxuan Zhang · Philip Torr · Mohamed Elhoseiny · Adel Bibi
2025 Poster

Abstract

Video

Chat is not available.