Skip to yearly menu bar Skip to main content


Poster

Efficient Streaming Language Models with Attention Sinks

Guangxuan Xiao ⋅ Yuandong Tian ⋅ Beidi Chen ⋅ Song Han ⋅ Mike Lewis
2024 Poster

Abstract

Video

Chat is not available.