Skip to yearly menu bar Skip to main content


LLM Neurosurgeon: Targeted Knowledge Removal in LLMs using Sparse Autoencoders

Dylan Zhou ⋅ Kunal Patil ⋅ Yifan Sun ⋅ Karthik lakshmanan ⋅ Senthooran Rajamanoharan ⋅ Arthur Conmy

Abstract

Chat is not available.