Skip to yearly menu bar Skip to main content


Poster

Forge: Compiling a Unified Abstraction into Scalable Kernels for Linear Attention

Haojie Duanmu · Size Zheng · Ningxin Zheng · Jianqiao Lu · Xuegui Zheng · Xingcheng Zhang · Li-Wen Chang · Xin Liu · Dahua Lin

Abstract

Log in and register to view live content