Skip to yearly menu bar Skip to main content


AlphaQ: Calibration-Free Bit Allocation for Mixture-of-Experts Quantization

Wanqi Yang ⋅ Yuexiao Ma ⋅ Alexander Conzelmann ⋅ Xiawu Zheng ⋅ Michael W Mahoney ⋅ T. Konstantin Rusch ⋅ Shiwei Liu

Abstract

Chat is not available.