File formats commonly used in mass spectrometry proteomics

Mol Cell Proteomics. 2012 Dec;11(12):1612-21. doi: 10.1074/mcp.R112.019695. Epub 2012 Sep 6.

Abstract

The application of mass spectrometry (MS) to the analysis of proteomes has enabled the high-throughput identification and abundance measurement of hundreds to thousands of proteins per experiment. However, the formidable informatics challenge associated with analyzing MS data has required a wide variety of data file formats to encode the complex data types associated with MS workflows. These formats encompass the encoding of input instruction for instruments, output products of the instruments, and several levels of information and results used by and produced by the informatics analysis tools. A brief overview of the most common file formats in use today is presented here, along with a discussion of related topics.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computer Storage Devices
  • Computer Systems
  • Databases, Protein
  • Electronic Data Processing*
  • Mass Spectrometry / instrumentation*
  • Proteome / analysis
  • Proteomics / instrumentation*
  • Software

Substances

  • Proteome