Protein folds in the worm genome

Pac Symp Biocomput. 2000:30-41. doi: 10.1142/9789814447331_0004.

Abstract

We survey the protein folds in the worm genome, using pairwise and multiple-sequence comparison methods (i.e. FASTA and PSI-blast). Overall, we find that approximately 250 folds match approximately 8000 domains in approximately 4500 ORFs, about 32 matches per fold involving a quarter of the total worm ORFs. We compare the folds in the worm genome to those in other model organisms, in particular yeast and E. coli, and find that the worm shares more folds with the phylogenetically closer yeast than with E. coli. There appear to be 36 folds unique to the worm compared to these two model organisms, and many of these are obviously implicated in aspects of multicellularity. The most common fold in the worm genome is the immunoglobulin fold, and many of the common folds are repeated in various combinations and permutations in multidomain proteins. In addition, an approach is presented for the identification of "sure" and "marginal" membrane proteins. When applied to the worm genome, this reveals a much greater relative prevalence of proteins with seven transmembrane helices in comparison to the other completely sequenced genomes, which are not of metazoans. Combining these analyses with some other simple filters allows one to identify ORFs that potentially code for soluble proteins of unknown fold, which may be promising targets for experimental investigation in structural genomics. A regularly updated worm fold analysis will be available from bioinfo.mbb.yale.edu/genome/worm.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics
  • Caenorhabditis elegans / genetics*
  • Computer Simulation
  • Escherichia coli / genetics
  • Fungal Proteins / chemistry
  • Fungal Proteins / genetics
  • Genome*
  • Helminth Proteins / chemistry*
  • Helminth Proteins / genetics*
  • Protein Folding
  • Saccharomyces cerevisiae / genetics
  • Sequence Alignment
  • Species Specificity

Substances

  • Bacterial Proteins
  • Fungal Proteins
  • Helminth Proteins