Skip to yearly menu bar Skip to main content


In-Person Poster presentation / top 25% paper

Mass-Editing Memory in a Transformer

Kevin Meng · Arnab Sen Sharma · Alex J Andonian · Yonatan Belinkov · David Bau

MH1-2-3-4 #33

Keywords: [ Applications ] [ memory ] [ transformers ] [ language models ] [ model editing ] [ GPT ] [ factual associations ]


Abstract:

Recent work has shown exciting promise in updating large language models with new memories, so as to replace obsolete information or add specialized knowledge. However, this line of work is predominantly limited to updating single associations. We develop MEMIT, a method for directly updating a language model with many memories, demonstrating experimentally that it can scale up to thousands of associations for GPT-J (6B) and GPT-NeoX (20B), exceeding prior work by an order of magnitude. Our code and data will be open-sourced upon publication.

Chat is not available.