Skip to yearly menu bar Skip to main content


OPUS: Towards Principled and Scalable Data Selection for Large Language Model Pre-training in Every Iteration

Shaobo Wang ⋅ Xuan Ouyang ⋅ Tianyi Xu ⋅ Yuzheng Hu ⋅ Jialin Liu ⋅ Guo Chen ⋅ Tianyu Zhang ⋅ Junhao Zheng ⋅ Kexin Yang ⋅ Xingzhang Ren ⋅ Dayiheng Liu ⋅ Linfeng Zhang

Abstract

Chat is not available.