Skip to yearly menu bar Skip to main content


Poster

SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration

Heming Xia · Yongqi Li · Jun Zhang · Cunxiao Du · Wenjie Li
2025 Poster

Abstract

Video

Chat is not available.