Skip to yearly menu bar Skip to main content


ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals

Utkarsh Saxena ⋅ Sayeh Sharify ⋅ Kaushik Roy ⋅ Xin Wang

Abstract

Chat is not available.