Skip to yearly menu bar Skip to main content


SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging

Aladin Djuhera ⋅ Swanand Kadhe ⋅ Farhan Ahmed ⋅ Syed Zawad ⋅ Holger Boche

Abstract

Chat is not available.