Poster
in
Workshop: ReALM-GEN: Real-World Constrained and Preference-Aligned Flow- and Diffusion-based Generative Models Mon, Apr 27, 2026 • 12:00 PM – 12:50 PM PDT

Adaptive Order Policies for Masked Diffusion

Mohsin Hasan ⋅ Jama Hussein Mohamud ⋅ Mirco Ravanellu ⋅ Yoshua Bengio

Project Page [ OpenReview]

Abstract

Masked diffusion models have seen great success in capturing data distributions over discrete sequences in domains such as text and proteins. These models generate data by iteratively unmasking tokens starting from a fully masked sequence, with the unmasking order typically chosen at random or using a heuristic based on denoiser probabilities. In this work, we propose a scheme for learning the unmasking order using an additional lightweight policy network on top of an existing diffusion model. Our proposed loss reweights terms in the masked diffusion loss according to policy probabilities, and results in a policy that prefers positions where the denoiser is more likely to be correct. We demonstrate that our approach outperforms common heuristics on problems that are sensitive to token ordering, such as Sudoku and Boolean satisfiability (3-SAT).

Chat is not available.