Skip to yearly menu bar Skip to main content


Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs

Zifei Xu · Sayeh Sharify · Wanzin Yazar · Tristan Webb · Xin Wang

Abstract

Chat is not available.