Skip to yearly menu bar Skip to main content


(6 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Fri Apr 24 06:30 AM -- 06:40 AM (PDT) @ 204 A/B None
The Polar Express: Optimal Matrix Sign Methods and their Application to the Muon Algorithm
Noah Amsel ⋅ David Persson ⋅ Christopher Musco ⋅ Robert M. Gower
[ OpenReview
Oral
Fri Apr 24 06:42 AM -- 06:52 AM (PDT) @ 204 A/B None
Temporal superposition and feature geometry of RNNs under memory demands
Pratyaksh Sharma ⋅ Alexandra M Proca ⋅ Lucas Prieto ⋅ Pedro Mediano
[ OpenReview
Oral
Fri Apr 24 06:54 AM -- 07:04 AM (PDT) @ 204 A/B None
Scaling Laws and Spectra of Shallow Neural Networks in the Feature Learning Regime
Leonardo Defilippis ⋅ Yizhou Xu ⋅ Julius Girardin ⋅ Vittorio Erba ⋅ Emanuele Troiani ⋅ Lenka Zdeborova ⋅ Bruno Loureiro ⋅ Florent Krzakala
[ OpenReview
Oral
Fri Apr 24 07:06 AM -- 07:16 AM (PDT) @ 204 A/B None
Efficient Resource-Constrained Training of Transformers via Subspace Optimization
Le-Trung Nguyen ⋅ Enzo Tartaglione ⋅ Van-Tam Nguyen
[ OpenReview
Oral
Fri Apr 24 07:18 AM -- 07:28 AM (PDT) @ 204 A/B None
Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention
Haiquan Qiu ⋅ Quanming Yao
[ OpenReview
Oral
Fri Apr 24 07:30 AM -- 07:40 AM (PDT) @ 204 A/B None
HATSolver: Learning Gröbner Bases with Hierarchical Attention Transformers
Mohamed Malhou ⋅ Ludovic Perret ⋅ Kristin Lauter
[ OpenReview