Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Fri Apr 24 11:15 AM -- 11:25 AM (PDT) @ 204 A/B None
OpenThoughts: Data Recipes for Reasoning Models
Etash Guha ⋅ Ryan Marten ⋅ Sedrick Keh ⋅ Negin Raoof ⋅ Georgios Smyrnis ⋅ Hritik Bansal ⋅ Marianna Nezhurina ⋅ Jean Mercat ⋅ Trung Vu ⋅ Zayne Sprague ⋅ Ashima Suvarna ⋅ Benjamin Feuer ⋅ Leon Liangyu Chen ⋅ Zaid Khan ⋅ Eric Frankel ⋅ Sachin Grover ⋅ Caroline Choi ⋅ Niklas Muennighoff ⋅ Shiye Su ⋅ Wanjia Zhao ⋅ John Yang ⋅ Shreyas Pimpalgaonkar ⋅ Kartik sharma ⋅ Charlie Ji ⋅ Yichuan Deng ⋅ Sarah Pratt ⋅ Vivek Ramanujan ⋅ Jon Saad-Falcon ⋅ Stutee Acharya ⋅ Jeffrey Li ⋅ Achal Dave ⋅ Alon Albalak ⋅ Kushal Arora ⋅ Blake Wulfe ⋅ Chinmay Hegde ⋅ Greg Durrett ⋅ Sewoong Oh ⋅ Mohit Bansal ⋅ Saadia Gabriel ⋅ Aditya Grover ⋅ Kai-Wei Chang ⋅ Vaishaal Shankar ⋅ Aaron Gokaslan ⋅ Mike Merrill ⋅ Tatsunori Hashimoto ⋅ Yejin Choi ⋅ Jenia Jitsev ⋅ Reinhard Heckel ⋅ Maheswaran Sathiamoorthy ⋅ Alex Dimakis ⋅ Ludwig Schmidt
[ OpenReview
Oral
Fri Apr 24 11:27 AM -- 11:37 AM (PDT) @ 204 A/B None
FRABench and UFEval: Unified Fine-grained Evaluation with Task and Aspect Generalization
Shibo Hong ⋅ jiahao ying ⋅ Haiyuan Liang ⋅ Mengdi Zhang ⋅ Jun Kuang ⋅ Jiazheng Zhang ⋅ Yixin Cao
[ OpenReview
Oral
Fri Apr 24 11:39 AM -- 11:49 AM (PDT) @ 204 A/B None
SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents
Gyuhyeon Seo ⋅ Jungwoo Yang ⋅ Junseong Pyo ⋅ Nalim Kim ⋅ Jonggeun Lee ⋅ Yohan Jo
[ Slides [ OpenReview
Oral
Fri Apr 24 11:51 AM -- 12:01 PM (PDT) @ 204 A/B None
Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training
Pierre-Carl Langlais ⋅ Pavel Chizhov ⋅ Catherine Arnett ⋅ Carlos Hinostroza ⋅ Mattia Nee ⋅ Eliot Jones ⋅ Irène Girard ⋅ David Mach ⋅ Anastasia Stasenko ⋅ Ivan Yamshchikov
[ OpenReview