firstbacksecondback
13 Results
Poster
|
Sat 23:45 |
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Victor Weixin Liang · Lili Yu · Liang Luo · Srini Iyer · Ning Dong · Chunting Zhou · Gargi Ghosh · Mike Lewis · Luke Zettlemoyer · Victoria Lin |