PDB file parser and structure class implemented in Python

Bioinformatics. 2003 Nov 22;19(17):2308-10. doi: 10.1093/bioinformatics/btg299.


The biopython project provides a set of bioinformatics tools implemented in Python. Recently, biopython was extended with a set of modules that deal with macromolecular structure. Biopython now contains a parser for PDB files that makes the atomic information available in an easy-to-use but powerful data structure. The parser and data structure deal with features that are often left out or handled inadequately by other packages, e.g. atom and residue disorder (if point mutants are present in the crystal), anisotropic B factors, multiple models and insertion codes. In addition, the parser performs some sanity checking to detect obvious errors.

Availability: The Biopython distribution (including source code and documentation) is freely available (under the Biopython license) from http://www.biopython.org

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Computer Simulation
  • Database Management Systems / standards*
  • Databases, Protein*
  • Information Storage and Retrieval / methods*
  • Information Storage and Retrieval / standards*
  • Macromolecular Substances
  • Models, Molecular*
  • Programming Languages*
  • Protein Conformation
  • Software*


  • Macromolecular Substances