Skip to yearly menu bar Skip to main content


Poster

Efficient Streaming Language Models with Attention Sinks

Guangxuan Xiao · Yuandong Tian · Beidi Chen · Song Han · Mike Lewis
2024 Poster

Abstract

Video

Chat is not available.