Skip to yearly menu bar Skip to main content


Oral Fri, Apr 24, 2026 • 11:51 AM – 12:01 PM PDT 204 A/B

Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training

Pierre-Carl Langlais ⋅ Pavel Chizhov ⋅ Catherine Arnett ⋅ Carlos Hinostroza ⋅ Mattia Nee ⋅ Eliot Jones ⋅ Irène Girard ⋅ David Mach ⋅ Anastasia Stasenko ⋅ Ivan Yamshchikov

Abstract

Log in and register to view live content