Skip to yearly menu bar Skip to main content


(6 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Fri Apr 25 12:30 AM -- 12:42 AM (PDT) @ Peridot 204-205 None
Cut Your Losses in Large-Vocabulary Language Models
Erik Wijmans · Brody Huval · Alexander Hertzberg · Vladlen Koltun · Philipp Krähenbühl
[ OpenReview
Oral
Fri Apr 25 12:42 AM -- 12:54 AM (PDT) @ Peridot 204-205 None
Your Mixture-of-Experts LLM Is Secretly an Embedding Model for Free
Ziyue Li · Tianyi Zhou
[ OpenReview
Oral
Fri Apr 25 12:54 AM -- 01:06 AM (PDT) @ Peridot 204-205 None
ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding
Zhengzhuo Xu · Bowen Qu · Yiyan Qi · SiNan Du · Chengjin Xu · Chun Yuan · Jian Guo
[ Slides [ OpenReview
Oral
Fri Apr 25 01:06 AM -- 01:18 AM (PDT) @ Peridot 204-205 None
MaestroMotif: Skill Design from Artificial Intelligence Feedback
Martin Klissarov · Mikael Henaff · Roberta Raileanu · Shagun Sodhani · Pascal Vincent · Amy Zhang · Pierre-Luc Bacon · Doina Precup · Marlos C. Machado · Pierluca D'Oro
[ OpenReview
Oral
Fri Apr 25 01:18 AM -- 01:30 AM (PDT) @ Peridot 204-205 None
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts
Peng Jin · Bo Zhu · Yuan Li · Shuicheng YAN
[ OpenReview
Oral
Fri Apr 25 01:30 AM -- 01:42 AM (PDT) @ Peridot 204-205 None
OLMoE: Open Mixture-of-Experts Language Models
Niklas Muennighoff · Luca Soldaini · Dirk Groeneveld · Kyle Lo · Jacob Morrison · Sewon Min · Weijia Shi · Pete Walsh · Oyvind Tafjord · Nathan Lambert · Yuling Gu · Shane Arora · Akshita Bhagia · Dustin Schwenk · David Wadden · Alexander Wettig · Binyuan Hui · Tim Dettmers · Douwe Kiela · Ali Farhadi · Noah Smith · Pang Wei Koh · Amanpreet Singh · Hanna Hajishirzi
[ OpenReview