Skip to yearly menu bar Skip to main content


Poster

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Chengyue Wu · Hao Zhang · Shuchen Xue · Zhijian Liu · Shizhe Diao · Ligeng Zhu · Ping Luo · Song Han · Enze Xie

Abstract

Log in and register to view live content