Proteome Data Improves Protein Function Prediction in the Interactome of Helicobacter pylori

Mol Cell Proteomics. 2018 May;17(5):961-973. doi: 10.1074/mcp.RA117.000474. Epub 2018 Feb 1.


Helicobacter pylori is a common pathogen that is estimated to infect half of the human population, causing several diseases such as duodenal ulcer. Despite one of the first pathogens to be sequenced, its proteome remains poorly characterized as about one-third of its proteins have no functional annotation. Here, we integrate and analyze known protein interactions with proteomic and genomic data from different sources. We find that proteins with similar abundances tend to interact. Such an observation is accompanied by a trend of interactions to appear between proteins of similar functions, although some show marked cross-talk to others. Protein function prediction with protein interactions is significantly improved when interactions from other bacteria are included in our network, allowing us to obtain putative functions of more than 300 poorly or previously uncharacterized proteins. Proteins that are critical for the topological controllability of the underlying network are significantly enriched with genes that are up-regulated in the spiral compared with the coccoid form of H. pylori Determining their evolutionary conservation, we present evidence that 80 protein complexes are identical in composition with their counterparts in Escherichia coli, while 85 are partially conserved and 120 complexes are completely absent. Furthermore, we determine network clusters that coincide with related functions, gene essentiality, genetic context, cellular localization, and gene expression in different cellular states.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Bacterial Proteins / metabolism*
  • Gene Expression Regulation
  • Genome, Bacterial
  • Helicobacter pylori / genetics
  • Helicobacter pylori / metabolism*
  • Models, Molecular
  • Multiprotein Complexes / metabolism
  • Operon / genetics
  • Phenotype
  • Protein Interaction Maps*
  • Proteome / metabolism*
  • Proteomics / methods*


  • Bacterial Proteins
  • Multiprotein Complexes
  • Proteome

Associated data

  • PDB/1A50
  • PDB/1PII
  • PDB/1KGZ
  • PDB/1I1Q
  • PDB/2EEY
  • PDB/2FUW
  • PDB/3RPF
  • PDB/2BZ0