Skip to yearly menu bar Skip to main content


Poster

SafeMoE: Safe Fine-Tuning for MoE LLMs by Aligning Harmful Input Routing

Jaehan Kim ⋅ Minkyoo Song ⋅ Seungwon Shin ⋅ Sooel Son

Abstract

Log in and register to view live content