Skip to yearly menu bar Skip to main content


In-Person Poster presentation / poster accept

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation

Bobby He ⋅ James Martens ⋅ Guodong Zhang ⋅ Aleksandar Botev ⋅ Andrew Brock ⋅ Samuel L Smith ⋅ Yee Whye Teh
2023 In-Person Poster presentation / poster accept

Abstract

Video

Chat is not available.