Skip to yearly menu bar Skip to main content


Poster

Harnessing Diversity for Important Data Selection in Pretraining Large Language Models

Chi Zhang ⋅ Huaping Zhong ⋅ Kuan Zhang ⋅ Chengliang Chai ⋅ Rui Wang ⋅ Xinlin Zhuang ⋅ Tianyi Bai ⋅ Qiu Jiantao ⋅ Lei Cao ⋅ Ju Fan ⋅ Ye Yuan ⋅ Guoren Wang ⋅ Conghui He
2025 Poster

Abstract

Video

Chat is not available.