A data processing pipeline for mammalian proteome dynamics studies using stable isotope metabolic labeling

Mol Cell Proteomics. 2011 Dec;10(12):M111.010728. doi: 10.1074/mcp.M111.010728. Epub 2011 Sep 21.

Abstract

In a recent study, in vivo metabolic labeling using (15)N traced the rate of label incorporation among more than 1700 proteins simultaneously and enabled the determination of individual protein turnover rate constants over a dynamic range of three orders of magnitude (Price, J. C., Guan, S., Burlingame, A., Prusiner, S. B., and Ghaemmaghami, S. (2010) Analysis of proteome dynamics in the mouse brain. Proc. Natl. Acad. Sci. U. S. A. 107, 14508-14513). These studies of protein dynamics provide a deeper understanding of healthy development and well-being of complex organisms, as well as the possible causes and progression of disease. In addition to a fully labeled food source and appropriate mass spectrometry platform, an essential and enabling component of such large scale investigations is a robust data processing and analysis pipeline, which is capable of the reduction of large sets of liquid chromatography tandem MS raw data files into the desired protein turnover rate constants. The data processing pipeline described in this contribution is comprised of a suite of software modules required for the workflow that fulfills such requirements. This software platform includes established software tools such as a mass spectrometry database search engine together with several additional, novel data processing modules specifically developed for (15)N metabolic labeling. These fulfill the following functions: (1) cross-extraction of (15)N-containing ion intensities from raw data files at varying biosynthetic incorporation times, (2) computation of peptide (15)N isotopic incorporation distributions, and (3) aggregation of relative isotope abundance curves for multiple peptides into single protein curves. In addition, processing parameter optimization and noise reduction procedures were found to be necessary in the processing modules in order to reduce propagation of errors in the long chain of the processing steps of the entire workflow.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Blood Proteins / chemistry
  • Blood Proteins / metabolism
  • Brain / metabolism
  • Chromatography, Liquid
  • Data Interpretation, Statistical*
  • Isotope Labeling
  • Liver / metabolism
  • Mice
  • Molecular Weight
  • Peptide Fragments / chemistry
  • Protein Stability
  • Proteome / chemistry
  • Proteome / metabolism*
  • Software*
  • Tandem Mass Spectrometry

Substances

  • Blood Proteins
  • Peptide Fragments
  • Proteome