Skip to yearly menu bar Skip to main content


Fast Gradient Computation for RoPE Attention in Almost Linear Time

Yifang Chen ⋅ Jiayan Huo ⋅ Xiaoyu Li ⋅ Yingyu Liang ⋅ Zhenmei Shi ⋅ Zhao Song

Abstract

Video

Chat is not available.