Skip to yearly menu bar Skip to main content


Poster

Probability Distributions Computed by Hard-Attention Transformers

Andy Yang · Anej Svete · Jiaoda Li · Anthony W. Lin · Jonathan Rawski · Ryan Cotterell · David Chiang

Abstract

Log in and register to view live content