Skip to yearly menu bar Skip to main content


Poster

Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts

Shwai He · Weilin Cai · Jiayi Huang · Ang Li

Abstract

Log in and register to view live content