Skip to yearly menu bar Skip to main content


Does LLM Pre-Training Typically Occur at the Edge of Stability?

Yuhang Cai ⋅ Haofeng Huang ⋅ Haodong Wen ⋅ Deyi Liu ⋅ Yiyuan Ma ⋅ Kaifeng Lyu

Abstract

Chat is not available.