Skip to yearly menu bar Skip to main content


Poster

Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Ziyue Li · Chenrui Fan · Tianyi Zhou

Abstract

Log in and register to view live content