ICLR Poster Instant Policy: In-Context Imitation Learning via Graph Diffusion

Poster

Instant Policy: In-Context Imitation Learning via Graph Diffusion

Vitalis Vosylius · Edward Johns

Hall 3 + Hall 2B #39

[ Abstract ] [ Project Page ]

Sat 26 Apr midnight PDT — 2:30 a.m. PDT

Oral presentation: Oral Session 5F
Fri 25 Apr 7:30 p.m. PDT — 9 p.m. PDT

Abstract:

Following the impressive capabilities of in-context learning with large transformers, In-Context Imitation Learning (ICIL) is a promising opportunity for robotics. We introduce Instant Policy, which learns new tasks instantly from just one or two demonstrations, achieving ICIL through two key components. First, we introduce inductive biases through a graph representation and model ICIL as a graph generation problem using a learned diffusion process, enabling structured reasoning over demonstrations, observations, and actions. Second, we show that such a model can be trained using pseudo-demonstrations – arbitrary trajectories generated in simulation – as a virtually infinite pool of training data. Our experiments, in both simulation and reality, show that Instant Policy enables rapid learning of various everyday robot tasks. We also show how it can serve as a foundation for cross-embodiment and zero-shot transfer to language-defined tasks.

Live content is unavailable. Log in and register to view live content