Skip to yearly menu bar Skip to main content


Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs

Zifei Xu ⋅ Sayeh Sharify ⋅ Wanzin Yazar ⋅ Tristan Webb ⋅ Xin Wang

Abstract

Chat is not available.