Understanding the Limitations of Variational Mutual Information Estimators

Jiaming Song; Stefano Ermon

Understanding the Limitations of Variational Mutual Information Estimators

Jiaming Song, Stefano Ermon

Keywords: mutual information, variance reduction

Abstract Paper Code Reviews

Abstract: Variational approaches based on neural networks are showing promise for estimating mutual information (MI) between high dimensional variables. However, they can be difficult to use in practice due to poorly understood bias/variance tradeoffs. We theoretically show that, under some conditions, estimators such as MINE exhibit variance that could grow exponentially with the true amount of underlying MI. We also empirically demonstrate that existing estimators fail to satisfy basic self-consistency properties of MI, such as data processing and additivity under independence. Based on a unified perspective of variational approaches, we develop a new estimator that focuses on variance reduction. Empirical results on standard benchmark tasks demonstrate that our proposed estimator exhibits improved bias-variance trade-offs on standard benchmark tasks.

Understanding the Limitations of Variational Mutual Information Estimators

Jiaming Song, Stefano Ermon

Similar Papers

Estimating Gradients for Discrete Random Variables by Sampling without Replacement

Wouter Kool, Herke van Hoof, Max Welling,

Mutual Information Gradient Estimation for Representation Learning

Liangjian Wen, Yiji Zhou, Lirong He, Mingyuan Zhou, Zenglin Xu,

SUMO: Unbiased Estimation of Log Marginal Probability for Latent Variable Models

Yucen Luo, Alex Beatson, Mohammad Norouzi, Jun Zhu, David Duvenaud, Ryan P. Adams, Ricky T. Q. Chen,