Domains, motifs and clusters in the protein universe

Jinfeng Liu; Burkhard Rost

doi:10.1016/s1367-5931(02)00003-0

Domains, motifs and clusters in the protein universe

Curr Opin Chem Biol. 2003 Feb;7(1):5-11. doi: 10.1016/s1367-5931(02)00003-0.

Authors

Jinfeng Liu¹, Burkhard Rost

Affiliation

¹ CUBIC and North East Structural Genomics Consortium, Department of Biochemistry and Molecular Biophysics, Columbia University, 650 West 168th Street BB217, New York, NY 10032, USA.

PMID: 12547420
DOI: 10.1016/s1367-5931(02)00003-0

Abstract

The rapid growth of bio-sequence information has resulted in an increasing demand for reliable methods that group proteins. A few databases with curated alignments of protein families have demonstrated that expert-driven repositories can keep up with the data deluge in the genome era. These original resources implicitly identify domain-like modules in proteins. An increasing number of automatic methods have sprouted over the past few years that cluster the protein universe. Many of these implicitly dissect proteins into structural domain-like fragments. In a very coarse-grained evaluation, some of the automatic methods appear to be on par with expert-driven approaches. However, neither automatic nor manual methods are currently entirely up to the challenges of tasks such as target selection in structural genomics. Thus, we urgently need refined and sustained automatic clustering tools.

Publication types

Research Support, U.S. Gov't, P.H.S.
Review

MeSH terms

Amino Acid Motifs
Cluster Analysis
Databases, Protein*
Expert Systems
Protein Structure, Tertiary
Proteins / classification*
Sequence Homology, Amino Acid
Structural Homology, Protein

Substances

Proteins

Abstract

Publication types

MeSH terms

Substances

Grants and funding