Organization and structural evolution of four multigene families in Arabidopsis thaliana: AtLCAD, AtLGT, AtMYST and AtHD-GL2

Plant Mol Biol. 2000 Mar;42(5):703-17. doi: 10.1023/a:1006368316413.


The Arabidopsis Genome Initiative has released up to now more than 80% of the genome sequence of Arabidopsis thaliana. About 70% of the identified genes have at least one paralogue. In order to understand the biological function of individual genes, it is essential to study the structure, expression and organization of the entire multigene family. A systematic analysis of multigene families, made possible by the amount of genomic sequence data available, provides important clues for the understanding of genome evolution and plasticity. In this paper, four multigene families of A. thaliana are characterized, namely LCAD, HD-GL2, LGT and MYST. Members of HD-GL2 and LCAD have already been reported in plants. The LGT genes specify proteins containing motifs of glycosyl transferase. No plant genes similar to the LGT genes have been reported to date. The novel MYST family, most likely plant-specific, encodes proteins with no identified function. Sequencing and in silico analysis led to the characterization of 29 novel genes belonging to these four gene families. The organization, structure and evolution of all the members of the four families are discussed, as well as their chromosome location. Expression data of some of the paralogues of each family are also presented.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alcohol Oxidoreductases / genetics
  • Amino Acid Sequence
  • Arabidopsis / genetics*
  • Arabidopsis Proteins*
  • Chromosome Mapping
  • DNA, Plant / chemistry
  • DNA, Plant / genetics
  • Evolution, Molecular*
  • Exons
  • Gene Expression
  • Gene Expression Regulation, Plant
  • Genes, Plant / genetics*
  • Glycosyltransferases / genetics
  • Homeodomain Proteins / genetics
  • Introns
  • Molecular Sequence Data
  • Multigene Family / genetics*
  • Phylogeny
  • Plant Proteins / genetics
  • RNA, Plant / genetics
  • RNA, Plant / metabolism
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Homology, Amino Acid
  • Tissue Distribution


  • Arabidopsis Proteins
  • DNA, Plant
  • GL2 protein, Arabidopsis
  • Homeodomain Proteins
  • Plant Proteins
  • RNA, Plant
  • Alcohol Oxidoreductases
  • cinnamyl alcohol dehydrogenase
  • Glycosyltransferases

Associated data

  • GENBANK/AJ224338
  • GENBANK/AJ243015
  • GENBANK/Y16848