Bayesian Comparison of Latent Variable Models: Conditional Versus Marginal Likelihoods

Psychometrika. 2019 Sep;84(3):802-829. doi: 10.1007/s11336-019-09679-0. Epub 2019 Jul 11.

Abstract

Typical Bayesian methods for models with latent variables (or random effects) involve directly sampling the latent variables along with the model parameters. In high-level software code for model definitions (using, e.g., BUGS, JAGS, Stan), the likelihood is therefore specified as conditional on the latent variables. This can lead researchers to perform model comparisons via conditional likelihoods, where the latent variables are considered model parameters. In other settings, however, typical model comparisons involve marginal likelihoods where the latent variables are integrated out. This distinction is often overlooked despite the fact that it can have a large impact on the comparisons of interest. In this paper, we clarify and illustrate these issues, focusing on the comparison of conditional and marginal Deviance Information Criteria (DICs) and Watanabe-Akaike Information Criteria (WAICs) in psychometric modeling. The conditional/marginal distinction corresponds to whether the model should be predictive for the clusters that are in the data or for new clusters (where "clusters" typically correspond to higher-level units like people or schools). Correspondingly, we show that marginal WAIC corresponds to leave-one-cluster out cross-validation, whereas conditional WAIC corresponds to leave-one-unit out. These results lead to recommendations on the general application of the criteria to models with latent variables.

Keywords: Bayesian information criteria; DIC; IRT; MCMC; SEM; WAIC; conditional likelihood; cross-validation; leave-one-cluster out; marginal likelihood.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bayes Theorem*
  • Cluster Analysis
  • Computer Simulation / standards*
  • Epidemiologic Measurements
  • Humans
  • Latent Class Analysis*
  • Likelihood Functions*
  • Male
  • Markov Chains
  • Monte Carlo Method
  • Predictive Value of Tests
  • Psychometrics
  • Software