Skip to yearly menu bar Skip to main content


Scaling-Law Analysis of SignSGD: From Feature-Space Linear Regression to LLM Pre-training

Zilin Wang ⋅ Binghui Li ⋅ Lean Wang ⋅ Jianan Wang ⋅ Jinbo Wang ⋅ Lei Wu

Abstract

Chat is not available.