Skip to yearly menu bar Skip to main content


Poster

Improving Block-Wise LLM Quantization by 4-bit Block-Wise Optimal Float (BOF4): Analysis and Variations

Patrick Blumenberg · Thomas Graave · Tim Fingscheidt

Abstract

Log in and register to view live content