ICLR Poster What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Poster

What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Ahmed Imtiaz Humayun · Ibtihel Amara · Cristina Nader Vasconcelos · Deepak Ramachandran · Candice Schumann · Junfeng He · Katherine Heller · Golnoosh Farnadi · Negar Rostamzadeh · Mohammad Havaei

Hall 3 + Hall 2B #604

[ Abstract ]

Fri 25 Apr 7 p.m. PDT — 9:30 p.m. PDT

Abstract: Deep Generative Models are frequently used to learn continuous representations of complex data distributions using a finite number of samples. For any generative model, including pre-trained foundation models with GAN, Transformer or Diffusion architectures, generation performance can vary significantly based on which part of the learned data manifold is sampled. In this paper we study the post-training local geometry of the learned manifold and its relationship to generation outcomes for models ranging from toy settings to the latent decoder of the near state-of-the-art Stable Diffusion 1.4 Text-to-Image model. Building on the theory of continuous piecewise-linear (CPWL) generators, we characterize the local geometry in terms of three geometric descriptors - scaling (

ψ

$\psi$ ), rank (

ν

$\nu$ ), and complexity (

δ

$\delta$ ). We provide quantitative and qualitative evidence showing that for a given latent, the local descriptors are indicative of generation aesthetics, artifacts, diversity, and memorization. Finally we demonstrate that training a reward model using the local geometry allows us to control the log-likelihood of a generated sample under the learned distribution, and improve the qualitative aspects of an image.

Live content is unavailable. Log in and register to view live content