Mining the Human Tissue Proteome for Protein Citrullination

Mol Cell Proteomics. 2018 Jul;17(7):1378-1391. doi: 10.1074/mcp.RA118.000696. Epub 2018 Apr 2.

Abstract

Citrullination is a posttranslational modification of arginine catalyzed by five peptidylarginine deiminases (PADs) in humans. The loss of a positive charge may cause structural or functional alterations, and while the modification has been linked to several diseases, including rheumatoid arthritis (RA) and cancer, its physiological or pathophysiological roles remain largely unclear. In part, this is owing to limitations in available methodology to robustly enrich, detect, and localize the modification. As a result, only a few citrullination sites have been identified on human proteins with high confidence. In this study, we mined data from mass-spectrometry-based deep proteomic profiling of 30 human tissues to identify citrullination sites on endogenous proteins. Database searching of ∼70 million tandem mass spectra yielded ∼13,000 candidate spectra, which were further triaged by spectrum quality metrics and the detection of the specific neutral loss of isocyanic acid from citrullinated peptides to reduce false positives. Because citrullination is easily confused with deamidation, we synthetized ∼2,200 citrullinated and 1,300 deamidated peptides to build a library of reference spectra. This led to the validation of 375 citrullination sites on 209 human proteins. Further analysis showed that >80% of the identified modifications sites were new, and for 56% of the proteins, citrullination was detected for the first time. Sequence motif analysis revealed a strong preference for Asp and Gly, residues around the citrullination site. Interestingly, while the modification was detected in 26 human tissues with the highest levels found in the brain and lung, citrullination levels did not correlate well with protein expression of the PAD enzymes. Even though the current work represents the largest survey of protein citrullination to date, the modification was mostly detected on high abundant proteins, arguing that the development of specific enrichment methods would be required in order to study the full extent of cellular protein citrullination.

Keywords: Data evaluation; Omics; Post-translational modifications*; Tandem Mass Spectrometry; Tissues*; citrullination; human proteome; peptidylarginine deiminase; synthetic peptides.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Citrullination*
  • Data Mining*
  • Decision Trees
  • Humans
  • Organ Specificity*
  • Peptides / metabolism
  • Proteome / metabolism*
  • Reproducibility of Results

Substances

  • Peptides
  • Proteome