Skip to yearly menu bar Skip to main content


From Dense to Dynamic: Token-Difficulty Driven MoEfication of Pre-Trained LLMs

Kumari Nishu ⋅ Sachin Mehta ⋅ Samira Abnar ⋅ Mehrdad Farajtabar ⋅ Maxwell Horton ⋅ Mahyar Najibi ⋅ Moin Nabi ⋅ Minsik Cho ⋅ Devang Naik

Abstract

Chat is not available.