Real time structural search of the Protein Data Bank

PLoS Comput Biol. 2020 Jul 8;16(7):e1007970. doi: 10.1371/journal.pcbi.1007970. eCollection 2020 Jul.

Abstract

Detection of protein structure similarity is a central challenge in structural bioinformatics. Comparisons are usually performed at the polypeptide chain level, however the functional form of a protein within the cell is often an oligomer. This fact, together with recent growth of oligomeric structures in the Protein Data Bank (PDB), demands more efficient approaches to oligomeric assembly alignment/retrieval. Traditional methods use atom level information, which can be complicated by the presence of topological permutations within a polypeptide chain and/or subunit rearrangements. These challenges can be overcome by comparing electron density volumes directly. But, brute force alignment of 3D data is a compute intensive search problem. We developed a 3D Zernike moment normalization procedure to orient electron density volumes and assess similarity with unprecedented speed. Similarity searching with this approach enables real-time retrieval of proteins/protein assemblies resembling a target, from PDB or user input, together with resulting alignments (http://shape.rcsb.org).

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Databases, Protein*
  • Internet
  • Models, Molecular
  • Models, Statistical
  • Normal Distribution
  • Peptides / chemistry
  • Protein Conformation
  • Proteins / chemistry*
  • Software

Substances

  • Peptides
  • Proteins