Tandem duplication of the FLC locus and the origin of a new gene in Arabidopsis related species and their functional implications in allopolyploids

New Phytol. 2010 Apr;186(1):228-38. doi: 10.1111/j.1469-8137.2009.03164.x. Epub 2010 Jan 22.

Abstract

Flowering time is an important adaptive trait and varies among Arabidopsis thaliana and its related species, including allopolyploids that are formed between A. thaliana and Arabidopsis arenosa. FLOWERING LOCUS C (FLC) inhibits early flowering in A. thaliana. A previous study showed that late-flowering A. arenosa contained two or more FLC alleles that were differentially expressed in Arabidopsis allotetraploids, but the genomic organization and evolution of FLC locus were unknown. Comparative sequence and evolutionary analyses were performed in FLC-containing genomic regions in A. thaliana, A. arenosa and Arabidopsis lyrata, and expression of FLC loci and alleles was examined in Arabidopsis allopolyploids. The FLC locus was tandemly duplicated in A. lyrata and triplicated in A. arenosa, and the tandem duplication event occurred after divergence from A. thaliana. Although FLC duplicates were highly conserved, their upstream sequences rapidly diverged. The third FLC copy in A. arenosa acquired a new splicing site through a point mutation in the intron and generated the new exon followed by an early stop codon, resulting in a novel MADS box gene. Flowering time variation in Arabidopsis allopolyploids is probably related to the expression diversity and/or copy number of multiple FLC loci. Moreover, exonization of intronic sequence is a mechanism for the origin of new genes.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Arabidopsis / genetics*
  • Arabidopsis Proteins / genetics*
  • Evolution, Molecular
  • Gene Duplication*
  • Genes, Plant / genetics*
  • Genetic Loci / genetics*
  • Genome, Plant / genetics
  • MADS Domain Proteins / genetics
  • Models, Genetic
  • Polyploidy*
  • Regulatory Sequences, Nucleic Acid
  • Sequence Analysis, DNA
  • Sequence Homology, Nucleic Acid
  • Synteny / genetics

Substances

  • Arabidopsis Proteins
  • MADS Domain Proteins