Genome-scale discovery and characterization of class-specific protein sequences: an example using the protein phosphatases of Arabidopsis thaliana

Methods Mol Biol. 2007;365:347-70. doi: 10.1385/1-59745-267-X:347.


The increasing pace of acquisition of fully sequenced genomes makes desirable a program of discovery and characterization of protein sequences of biologically significant structural classes. An example is protein phosphatases, involved in modulating reversible protein phosphorylation events underlying the whole gamut of cellular biology. The ready availability of software that can be downloaded to run on a personal computer, or accessed on a server via the Web, allows appropriate sequences to be collected and analyzed. A process is outlined here that has been successfully employed in the description of the genomic complement of protein phosphatase catalytic subunits from the model plant Arabidopsis thaliana. However, the methods are general and readily adapted to deal with any desired class of protein, from any organism of interest.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Arabidopsis Proteins / chemistry
  • Arabidopsis Proteins / genetics
  • Arabidopsis Proteins / metabolism*
  • Evolution, Molecular
  • Genome, Plant*
  • Genomics / methods
  • Molecular Sequence Data
  • Phosphoprotein Phosphatases / chemistry
  • Phosphoprotein Phosphatases / genetics
  • Phosphoprotein Phosphatases / metabolism*
  • Phosphorylation
  • Phylogeny
  • Sequence Analysis, Protein
  • Sequence Homology, Amino Acid


  • Arabidopsis Proteins
  • Phosphoprotein Phosphatases