A cautionary note on the robustness of latent class models for estimating diagnostic error without a gold standard

Biometrics. 2004 Jun;60(2):427-35. doi: 10.1111/j.0006-341X.2004.00187.x.


Modeling diagnostic error without a gold standard has been an active area of biostatistical research. In a majority of the approaches, model-based estimates of sensitivity, specificity, and prevalence are derived from a latent class model in which the latent variable represents an individual's true unobserved disease status. For simplicity, initial approaches assumed that the diagnostic test results on the same subject were independent given the true disease status (i.e., the conditional independence assumption). More recently, various authors have proposed approaches for modeling the dependence structure between test results given true disease status. This note discusses a potential problem with these approaches. Namely, we show that when the conditional dependence between tests is misspecified, estimators of sensitivity, specificity, and prevalence can be biased. Importantly, we demonstrate that with small numbers of tests, likelihood comparisons and other model diagnostics may not be able to distinguish between models with different dependence structures. We present asymptotic results that show the generality of the problem. Further, data analysis and simulations demonstrate the practical implications of model misspecification. Finally, we present some guidelines about the use of these models for practitioners.

MeSH terms

  • Biometry*
  • Dental Caries / diagnosis
  • Dental Caries / diagnostic imaging
  • Diagnostic Errors / statistics & numerical data*
  • Humans
  • Likelihood Functions
  • Models, Statistical*
  • Radiography
  • Sensitivity and Specificity