Skip to yearly menu bar Skip to main content


Oral Fri, Apr 24, 2026 • 7:18 AM – 7:28 AM PDT 204 A/B

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Haiquan Qiu ⋅ Quanming Yao

Abstract

Video

Chat is not available.