Bayesian Case Influence Measures for Statistical Models with Missing Data

J Comput Graph Stat. 2012;21(1):253-271. doi: 10.1198/jcgs.2011.10139. Epub 2010 Aug 1.

Abstract

We examine three Bayesian case influence measures including the φ-divergence, Cook's posterior mode distance and Cook's posterior mean distance for identifying a set of influential observations for a variety of statistical models with missing data including models for longitudinal data and latent variable models in the absence/presence of missing data. Since it can be computationally prohibitive to compute these Bayesian case influence measures in models with missing data, we derive simple first-order approximations to the three Bayesian case influence measures by using the Laplace approximation formula and examine the applications of these approximations to the identification of influential sets. All of the computations for the first-order approximations can be easily done using Markov chain Monte Carlo samples from the posterior distribution based on the full data. Simulated data and an AIDS dataset are analyzed to illustrate the methodology.

Keywords: Case influence measures; Cook distance; First-order approximation; Markov chain Monte Carlo; ϕ-divergence.

Grant support