Multi-source feature learning for joint analysis of incomplete multiple heterogeneous neuroimaging data

Neuroimage. 2012 Jul 2;61(3):622-32. doi: 10.1016/j.neuroimage.2012.03.059. Epub 2012 Mar 29.

Abstract

Analysis of incomplete data is a big challenge when integrating large-scale brain imaging datasets from different imaging modalities. In the Alzheimer's Disease Neuroimaging Initiative (ADNI), for example, over half of the subjects lack cerebrospinal fluid (CSF) measurements; an independent half of the subjects do not have fluorodeoxyglucose positron emission tomography (FDG-PET) scans; many lack proteomics measurements. Traditionally, subjects with missing measures are discarded, resulting in a severe loss of available information. In this paper, we address this problem by proposing an incomplete Multi-Source Feature (iMSF) learning method where all the samples (with at least one available data source) can be used. To illustrate the proposed approach, we classify patients from the ADNI study into groups with Alzheimer's disease (AD), mild cognitive impairment (MCI) and normal controls, based on the multi-modality data. At baseline, ADNI's 780 participants (172AD, 397 MCI, 211 NC), have at least one of four data types: magnetic resonance imaging (MRI), FDG-PET, CSF and proteomics. These data are used to test our algorithm. Depending on the problem being solved, we divide our samples according to the availability of data sources, and we learn shared sets of features with state-of-the-art sparse learning methods. To build a practical and robust system, we construct a classifier ensemble by combining our method with four other methods for missing value estimation. Comprehensive experiments with various parameters show that our proposed iMSF method and the ensemble model yield stable and promising results.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Aged
  • Algorithms
  • Alzheimer Disease / cerebrospinal fluid
  • Alzheimer Disease / pathology
  • Artificial Intelligence*
  • Cognitive Dysfunction / cerebrospinal fluid
  • Cognitive Dysfunction / pathology
  • Databases, Factual
  • Female
  • Fluorodeoxyglucose F18
  • Humans
  • Image Processing, Computer-Assisted / methods*
  • Image Processing, Computer-Assisted / statistics & numerical data
  • Magnetic Resonance Imaging
  • Male
  • Middle Aged
  • Neuroimaging / instrumentation
  • Neuroimaging / methods*
  • Positron-Emission Tomography
  • Proteomics
  • Radiopharmaceuticals

Substances

  • Radiopharmaceuticals
  • Fluorodeoxyglucose F18