Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
Victor Weixin Liang ⋅ Lili Yu ⋅ Liang Luo ⋅ Srini Iyer ⋅ Ning Dong ⋅ Chunting Zhou ⋅ Gargi Ghosh ⋅ Mike Lewis ⋅ Luke Zettlemoyer ⋅ Victoria Lin
2025 Oral
Chat is not available.
Successful Page Load