Skip to yearly menu bar Skip to main content


Poster

Learning to Describe Scenes with Programs

Yunchao Liu · Zheng Wu · Daniel Ritchie · William Freeman · Joshua B Tenenbaum · Jiajun Wu

Great Hall BC #72

Keywords: [ program synthesis ] [ structured scene representations ]


Abstract:

Human scene perception goes beyond recognizing a collection of objects and their pairwise relations. We understand higher-level, abstract regularities within the scene such as symmetry and repetition. Current vision recognition modules and scene representations fall short in this dimension. In this paper, we present scene programs, representing a scene via a symbolic program for its objects, attributes, and their relations. We also propose a model that infers such scene programs by exploiting a hierarchical, object-based scene representation. Experiments demonstrate that our model works well on synthetic data and transfers to real images with such compositional structure. The use of scene programs has enabled a number of applications, such as complex visual analogy-making and scene extrapolation.

Live content is unavailable. Log in and register to view live content