Safeguarding Structural Data Repositories against Bad Apples

Structure. 2016 Feb 2;24(2):216-20. doi: 10.1016/j.str.2015.12.010.


Structural biology research generates large amounts of data, some deposited in public databases or repositories, but a substantial remainder never becomes available to the scientific community. In addition, some of the deposited data contain less or more serious errors that may bias the results of data mining. Thorough analysis and discussion of these problems is needed to ameliorate this situation. This perspective is an attempt to propose some solutions and encourage both further discussion and action on the part of the relevant organizations, in particular the PDB and various bodies of the International Union of Crystallography.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Computational Biology / methods*
  • Data Accuracy
  • Data Mining
  • Databases, Protein / standards*
  • Models, Molecular
  • Protein Conformation
  • Proteins / chemistry*


  • Proteins