Skip to yearly menu bar Skip to main content


Poster Sat, Apr 25, 2026 • 6:30 AM – 9:00 AM PDT Pavilion 4 P4-#5215

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Chengyue Wu ⋅ Hao Zhang ⋅ Shuchen Xue ⋅ Zhijian Liu ⋅ Shizhe Diao ⋅ Ligeng Zhu ⋅ Ping Luo ⋅ Song Han ⋅ Enze Xie

Abstract

Log in and register to view live content