Differential Abundance Analysis with Bayes Shrinkage Estimation of Variance (DASEV) for Zero-Inflated Proteomic and Metabolomic Data

Sci Rep. 2020 Jan 21;10(1):876. doi: 10.1038/s41598-020-57470-4.

Abstract

Mass spectrometry (MS) is frequently used for proteomic and metabolomic profiling of biological samples. Data obtained by MS are often zero-inflated. Those zero values are called point mass values (PMVs). Zero values can be further grouped into biological PMVs and technical PMVs. The former type is caused by true absence of a compound and the later type is caused by a technical detection limit. Methods based on a mixture model have been developed to separate the two types of zeros and to perform differential abundance analysis comparing proteomic/metabolomic profiles between different groups of subjects. However, we notice that those methods may give unstable estimate of the model variance, and thus lead to false positive and false negative results when the number of non-zero values is small. In this paper, we propose a new differential abundance analysis method, DASEV, which uses an empirical Bayes shrinkage method to more robustly estimate the variance and enhance the accuracy of differential abundance analysis. Simulation studies and real data analysis show that DASEV substantially improves parameter estimation of the mixture model and outperforms current methods in identifying differentially abundant features.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Analysis of Variance
  • Bayes Theorem
  • Carcinoma, Non-Small-Cell Lung / metabolism
  • Carcinoma, Non-Small-Cell Lung / pathology
  • Databases, Factual
  • Exosomes
  • Humans
  • Lipid Metabolism
  • Lung Neoplasms / metabolism
  • Lung Neoplasms / pathology
  • Mass Spectrometry / statistics & numerical data*
  • Metabolomics / statistics & numerical data
  • Models, Statistical*
  • Proteinuria / metabolism
  • Proteome / metabolism
  • Proteomics / statistics & numerical data

Substances

  • Proteome