Skip to yearly menu bar Skip to main content


Poster

Attention Sinks and Compression Valleys in LLMs are Two Sides of the Same Coin

Enrique Queipo-de-Llano · Alvaro Arroyo · Federico Barbero · Xiaowen Dong · Michael Bronstein · Yann LeCun · Ravid Shwartz-Ziv

Abstract

Log in and register to view live content