Skip to yearly menu bar Skip to main content


Poster

Language models scale reliably with over-training and on downstream tasks

Samir Yitzhak Gadre · Georgios Smyrnis · Vaishaal Shankar · Suchin Gururangan · Mitchell Wortsman · Rulin Shao · Jean Mercat · Alex Fang · Jeffrey Li · Sedrick Keh · Rui Xin · Marianna Nezhurina · Igor Vasiljevic · Luca Soldaini · Jenia Jitsev · Alex Dimakis · Gabriel Ilharco · Pang Wei Koh · Shuran Song · Thomas Kollar · Yair Carmon · Achal Dave · Reinhard Heckel · Niklas Muennighoff · Ludwig Schmidt
2025 Poster

Abstract

Video

Chat is not available.