Differentiable samplers for deep latent variable models

Philos Trans A Math Phys Eng Sci. 2023 May 15;381(2247):20220147. doi: 10.1098/rsta.2022.0147. Epub 2023 Mar 27.

Abstract

Latent variable models are a popular class of models in statistics. Combined with neural networks to improve their expressivity, the resulting deep latent variable models have also found numerous applications in machine learning. A drawback of these models is that their likelihood function is intractable so approximations have to be carried out to perform inference. A standard approach consists of maximizing instead an evidence lower bound (ELBO) obtained based on a variational approximation of the posterior distribution of the latent variables. The standard ELBO can, however, be a very loose bound if the variational family is not rich enough. A generic strategy to tighten such bounds is to rely on an unbiased low-variance Monte Carlo estimate of the evidence. We review here some recent importance sampling, Markov chain Monte Carlo and sequential Monte Carlo strategies that have been proposed to achieve this. This article is part of the theme issue 'Bayesian inference: challenges, perspectives, and prospects'.

Keywords: Bayesian inference; Monte Carlo methods; importance sampling; variational inference.

Publication types

  • Review