Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
Victor Weixin Liang · Lili Yu · Liang Luo · Srini Iyer · Ning Dong · Chunting Zhou · Gargi Ghosh · Mike Lewis · Luke Zettlemoyer · Victoria Lin
2025 Oral
Chat is not available.
Successful Page Load