ICLR Poster Learning to Discretize Denoising Diffusion ODEs

Poster

Learning to Discretize Denoising Diffusion ODEs

Vinh Tong · Trung-Dung Hoang · Anji Liu · Guy Van den Broeck · Mathias Niepert

Hall 3 + Hall 2B #192

[ Abstract ]

Thu 24 Apr midnight PDT — 2:30 a.m. PDT

Oral presentation: Oral Session 1C
Wed 23 Apr 7:30 p.m. PDT — 9 p.m. PDT

Abstract:

Diffusion Probabilistic Models (DPMs) are generative models showing competitive performance in various domains, including image synthesis and 3D point cloud generation. Sampling from pre-trained DPMs involves multiple neural function evaluations (NFEs) to transform Gaussian noise samples into images, resulting in higher computational costs compared to single-step generative models such as GANs or VAEs. Therefore, reducing the number of NFEs while preserving generation quality is crucial. To address this, we propose LD3, a lightweight framework designed to learn the optimal time discretization for sampling. LD3 can be combined with various samplers and consistently improves generation quality without having to retrain resource-intensive neural networks. We demonstrate analytically and empirically that LD3 improves sampling efficiency with much less computational overhead. We evaluate our method with extensive experiments on 7 pre-trained models, covering unconditional and conditional sampling in both pixel-space and latent-space DPMs. We achieve FIDs of 2.38 (10 NFE), and 2.27 (10 NFE) on unconditional CIFAR10 and AFHQv2 in 5-10 minutes of training. LD3 offers an efficient approach to sampling from pre-trained diffusion models. Code is available at https://github.com/vinhsuhi/LD3.

Live content is unavailable. Log in and register to view live content