Skip to yearly menu bar Skip to main content


Poster

CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention

Zhongzhu Zhou · Fengxiang Bie · Ziyan Chen · Zhenyu Zhang · Yibo Yang · Junxiong Wang · Ben Athiwaratkun · Xiaoxia (Shirley) Wu · Shuaiwen Song

Abstract

Log in and register to view live content