Skip to yearly menu bar Skip to main content


Oral #4: Understanding the Difficulty of Low-Precision Post-Training Quantization for LLMs

Xin Wang

Abstract

Video

Chat is not available.