CDART: protein homology by domain architecture

Genome Res. 2002 Oct;12(10):1619-23. doi: 10.1101/gr.278202.


The Conserved Domain Architecture Retrieval Tool (CDART) performs similarity searches of the NCBI Entrez Protein Database based on domain architecture, defined as the sequential order of conserved domains in proteins. The algorithm finds protein similarities across significant evolutionary distances using sensitive protein domain profiles rather than by direct sequence similarity. Proteins similar to a query protein are grouped and scored by architecture. Relying on domain profiles allows CDART to be fast, and, because it relies on annotated functional domains, informative. Domain profiles are derived from several collections of domain definitions that include functional annotation. Searches can be further refined by taxonomy and by selecting domains of interest. CDART is available at

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • BRCA1 Protein / chemistry
  • BRCA1 Protein / classification
  • Computational Biology / methods
  • Conserved Sequence*
  • Databases, Protein
  • Humans
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Sequence Homology, Amino Acid*
  • Software*


  • BRCA1 Protein
  • Proteins