Skip to yearly menu bar Skip to main content


Poster

The Effect of Attention Head Count on Transformer Approximation

Penghao Yu · Haotian Jiang · Zeyu Bao · Ruoxi Yu · Qianxiao Li

Abstract

Log in and register to view live content