Skip to yearly menu bar Skip to main content


Poster

Real-Time Video Generation with Pyramid Attention Broadcast

Xuanlei Zhao · Xiaolong Jin · Kai Wang · Yang You

Hall 3 + Hall 2B #617
[ ]
Fri 25 Apr midnight PDT — 2:30 a.m. PDT

Abstract:

We present Pyramid Attention Broadcast (PAB), a real-time, high quality and training-free approach for DiT-based video generation. Our method is founded on the observation that attention difference in the diffusion process exhibits a U-shaped pattern, indicating significant redundancy. We mitigate this by broadcasting attention outputs to subsequent steps in a pyramid style. It applies different broadcast strategies to each attention based on their variance for best efficiency. We further introduce broadcast sequence parallel for more efficient distributed inference. PAB demonstrates up to 10.5x speedup across three models compared to baselines, achieving real-time generation for up to 720p videos. We anticipate that our simple yet effective method will serve as a robust baseline and facilitate future research and application for video generation.

Live content is unavailable. Log in and register to view live content