Skip to yearly menu bar Skip to main content


Poster

Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training

Hong Liu ⋅ Zhiyuan Li ⋅ David Hall ⋅ Percy Liang ⋅ Tengyu Ma
2024 Poster

Abstract

Video

Chat is not available.