Identification, structure, and differential expression of members of a BURP domain containing protein family in soybean

Genome. 2002 Aug;45(4):693-701. doi: 10.1139/g02-032.

Abstract

Expressed sequence tags (ESTs) exhibiting homology to a BURP domain containing gene family were identified from the Glycine max (L.) Merr. EST database. These ESTs were assembled into 16 contigs of variable sizes and lengths. Consistent with the structure of known BURP domain containing proteins, the translation products exhibit a modular structure consisting of a C-terminal BURP domain, an N-terminal signal sequence, and a variable internal region. The soybean family members exhibit 35-98% similarity in a -100-amino-acid C-terminal region, and a phylogenetic tree constructed using this region shows that some soybean family members group together in closely related pairs, triplets, and quartets, whereas others remain as singletons. The structure of these groups suggests that multiple gene duplication events occurred during the evolutionary history of this family. The depth and diversity of G. max EST libraries allowed tissue-specific expression patterns of the putative soybean BURPs to be examined. Consistent with known BURP proteins, the newly identified soybean BURPs have diverse expression patterns. Furthermore, putative paralogs can have both spatially and quantitatively distinct expression patterns. We discuss the functional and evolutionary implications of these findings, as well as the utility of EST-based analyses for identifying and characterizing gene families.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Evolution, Molecular
  • Expressed Sequence Tags*
  • Gene Expression Profiling
  • Glycine max / genetics*
  • Molecular Sequence Data
  • Plant Proteins / genetics*
  • Protein Structure, Tertiary

Substances

  • Plant Proteins