AIDA: ab initio domain assembly for automated multi-domain protein structure prediction and domain-domain interaction prediction

Bioinformatics. 2015 Jul 1;31(13):2098-105. doi: 10.1093/bioinformatics/btv092. Epub 2015 Feb 19.

Abstract

Motivation: Most proteins consist of multiple domains, independent structural and evolutionary units that are often reshuffled in genomic rearrangements to form new protein architectures. Template-based modeling methods can often detect homologous templates for individual domains, but templates that could be used to model the entire query protein are often not available.

Results: We have developed a fast docking algorithm ab initio domain assembly (AIDA) for assembling multi-domain protein structures, guided by the ab initio folding potential. This approach can be extended to discontinuous domains (i.e. domains with 'inserted' domains). When tested on experimentally solved structures of multi-domain proteins, the relative domain positions were accurately found among top 5000 models in 86% of cases. AIDA server can use domain assignments provided by the user or predict them from the provided sequence. The latter approach is particularly useful for automated protein structure prediction servers. The blind test consisting of 95 CASP10 targets shows that domain boundaries could be successfully determined for 97% of targets.

Availability and implementation: The AIDA package as well as the benchmark sets used here are available for download at http://ffas.burnham.org/AIDA/.

Contact: adam@sanfordburnham.org

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Humans
  • Internet
  • Models, Theoretical*
  • Protein Conformation*
  • Protein Interaction Domains and Motifs
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Sequence Analysis, Protein
  • Software*

Substances

  • Proteins