Skip to yearly menu bar Skip to main content


Simple linear attention language models balance the recall-throughput tradeoff

Simran Arora ⋅ Sabri Eyuboglu ⋅ Michael Zhang ⋅ Aman Timalsina ⋅ Silas Alberti ⋅ James Y Zou ⋅ Atri Rudra ⋅ Christopher Re

Abstract

Chat is not available.