Skip to yearly menu bar Skip to main content


Poster

Taming Polysemanticity in LLMs: Theory-Grounded Feature Recovery via Sparse Autoencoders

Siyu Chen ⋅ Heejune Sheen ⋅ Xuyuan Xiong ⋅ Tianhao Wang ⋅ Zhuoran Yang

Abstract

Log in and register to view live content