Analysis of significant protein abundance from multiple reaction-monitoring data

BMC Syst Biol. 2018 Dec 31;12(Suppl 9):123. doi: 10.1186/s12918-018-0656-9.

Abstract

Background: Discovering reliable protein biomarkers is one of the most important issues in biomedical research. The ELISA is a traditional technique for accurate quantitation of well-known proteins. Recently, the multiple reaction-monitoring (MRM) mass spectrometry has been proposed for quantifying newly discovered protein and has become a popular alternative to ELISA. For the MRM data analysis, linear mixed modeling (LMM) has been used to analyze MRM data. MSstats is one of the most widely used tools for MRM data analysis that is based on the LMMs. However, LMMs often provide various significance results, depending on model specification. Sometimes it would be difficult to specify a correct LMM method for the analysis of MRM data. Here, we propose a new logistic regression-based method for Significance Analysis of Multiple Reaction Monitoring (LR-SAM).

Results: Through simulation studies, we demonstrate that LMM methods may not preserve type I error, thus yielding high false- positive errors, depending on how random effects are specified. Our simulation study also shows that the LR-SAM approach performs similarly well as LMM approaches, in most cases. However, LR-SAM performs better than the LMMs, particularly when the effects sizes of peptides from the same protein are heterogeneous. Our proposed method was applied to MRM data for identification of proteins associated with clinical responses of treatment of 115 hepatocellular carcinoma (HCC) patients with the tyrosine kinase inhibitor sorafenib. Of 124 candidate proteins, LMM approaches provided 6 results varying in significance, while LR-SAM, by contrast, yielded 18 significant results that were quite reproducibly consistent.

Conclusion: As exemplified by an application to HCC data set, LR-SAM more effectively identified proteins associated with clinical responses of treatment than LMM did.

Keywords: Hepatocellular carcinoma (HCC); Logistic regression-based method for significance analysis of multiple reaction monitoring (LR-SAM); Multiple reaction-monitoring (MRM); Protein; Sorafenib response.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers / metabolism
  • Biomarkers, Tumor / blood
  • Biomarkers, Tumor / metabolism
  • Carcinoma, Hepatocellular / blood
  • Carcinoma, Hepatocellular / metabolism
  • Computational Biology / methods*
  • Humans
  • Liver Neoplasms / blood
  • Liver Neoplasms / metabolism
  • Protein Interaction Mapping / methods*

Substances

  • Biomarkers
  • Biomarkers, Tumor