Skip to yearly menu bar Skip to main content


Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models

Jialin Zhao ⋅ Yingtao Zhang ⋅ Carlo Vittorio Cannistraci

Abstract

Chat is not available.