An ant colony optimisation algorithm for the 2D and 3D hydrophobic polar protein folding problem

BMC Bioinformatics. 2005 Feb 14:6:30. doi: 10.1186/1471-2105-6-30.

Abstract

Background: The protein folding problem is a fundamental problems in computational molecular biology and biochemical physics. Various optimisation methods have been applied to formulations of the ab-initio folding problem that are based on reduced models of protein structure, including Monte Carlo methods, Evolutionary Algorithms, Tabu Search and hybrid approaches. In our work, we have introduced an ant colony optimisation (ACO) algorithm to address the non-deterministic polynomial-time hard (NP-hard) combinatorial problem of predicting a protein's conformation from its amino acid sequence under a widely studied, conceptually simple model - the 2-dimensional (2D) and 3-dimensional (3D) hydrophobic-polar (HP) model.

Results: We present an improvement of our previous ACO algorithm for the 2D HP model and its extension to the 3D HP model. We show that this new algorithm, dubbed ACO-HPPFP-3, performs better than previous state-of-the-art algorithms on sequences whose native conformations do not contain structural nuclei (parts of the native fold that predominantly consist of local interactions) at the ends, but rather in the middle of the sequence, and that it generally finds a more diverse set of native conformations.

Conclusions: The application of ACO to this bioinformatics problem compares favourably with specialised, state-of-the-art methods for the 2D and 3D HP protein folding problem; our empirical results indicate that our rather simple ACO algorithm scales worse with sequence length but usually finds a more diverse ensemble of native states. Therefore the development of ACO algorithms for more complex and realistic models of protein structure holds significant promise.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Motifs
  • Amino Acid Sequence
  • Computational Biology / methods*
  • Databases, Factual
  • Evolution, Molecular
  • Models, Biological
  • Models, Chemical
  • Models, Molecular
  • Models, Statistical
  • Models, Theoretical
  • Molecular Conformation
  • Monte Carlo Method
  • Protein Conformation
  • Protein Denaturation
  • Protein Folding
  • Proteins / chemistry*
  • Proteomics / methods*
  • Software
  • Time Factors

Substances

  • Proteins