Detecting influential observations in a model-based cluster analysis

Stat Methods Med Res. 2018 Feb;27(2):521-540. doi: 10.1177/0962280216634112. Epub 2016 Mar 17.

Abstract

Finite mixture models have been used to model population heterogeneity and to relax distributional assumptions. These models are also convenient tools for clustering and classification of complex data such as, for example, repeated-measurements data. The performance of model-based clustering algorithms is sensitive to influential and outlying observations. Methods for identifying outliers in a finite mixture model have been described in the literature. Approaches to identify influential observations are less common. In this paper, we apply local-influence diagnostics to a finite mixture model with known number of components. The methodology is illustrated on real-life data.

Keywords: Local influence; finite mixture model; model-based clustering.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Biostatistics
  • Brain / drug effects
  • Brain / physiology
  • Cluster Analysis*
  • Computer Simulation
  • Electroencephalography / statistics & numerical data
  • Humans
  • Likelihood Functions
  • Models, Statistical*
  • Nonlinear Dynamics
  • Pharmacokinetics
  • Psychotropic Drugs / pharmacology
  • Rats

Substances

  • Psychotropic Drugs