Skip to yearly menu bar Skip to main content


Auditing Preference-Based Post-Training of LLMs via Strong Membership Inference Attacks

Lorenzo Rossi ⋅ Kaif A Shaikh ⋅ Franziska Boenisch ⋅ Adam Dziedzic

Abstract

Chat is not available.