Skip to yearly menu bar Skip to main content


Poster

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Jing Liu ⋅ Ruihao Gong ⋅ Xiuying Wei ⋅ Zhiwei Dong ⋅ Jianfei Cai ⋅ Bohan Zhuang
2024 Poster

Abstract

Video

Chat is not available.