Skip to yearly menu bar Skip to main content


Poster

Hessian-Enhanced Token Attribution (HETA): Interpreting Autoregressive LLMs

Vishal Pramanik · Maisha Maliha · Nathaniel Bastian · Sumit Jha

Abstract

Log in and register to view live content