A data analysis framework for combining multiple batches increases the power of isobaric proteomics experiments

Nat Methods. 2024 Feb;21(2):290-300. doi: 10.1038/s41592-023-02120-6. Epub 2023 Dec 18.

Abstract

We present a framework for the analysis of multiplexed mass spectrometry proteomics data that reduces estimation error when combining multiple isobaric batches. Variations in the number and quality of observations have long complicated the analysis of isobaric proteomics data. Here we show that the power to detect statistical associations is substantially improved by utilizing models that directly account for known sources of variation in the number and quality of observations that occur across batches.In a multibatch benchmarking experiment, our open-source software (msTrawler) increases the power to detect changes, especially in the range of less than twofold changes, while simultaneously increasing quantitative proteome coverage by utilizing more low-signal observations. Further analyses of previously published multiplexed datasets of 4 and 23 batches highlight both increased power and the ability to navigate complex missing data patterns without relying on unverifiable imputations or discarding reliable measurements.

MeSH terms

  • Mass Spectrometry / methods
  • Proteome / analysis
  • Proteomics* / methods
  • Software*

Substances

  • Proteome