Divergent evolutionary and expression patterns between lineage specific new duplicate genes and their parental paralogs in Arabidopsis thaliana

PLoS One. 2013 Aug 29;8(8):e72362. doi: 10.1371/journal.pone.0072362. eCollection 2013.

Abstract

Gene duplication is an important mechanism for the origination of functional novelties in organisms. We performed a comparative genome analysis to systematically estimate recent lineage specific gene duplication events in Arabidopsis thaliana and further investigate whether and how these new duplicate genes (NDGs) play a functional role in the evolution and adaption of A. thaliana. We accomplished this using syntenic relationship among four closely related species, A. thaliana, A. lyrata, Capsella rubella and Brassica rapa. We identified 100 NDGs, showing clear origination patterns, whose parental genes are located in syntenic regions and/or have clear orthologs in at least one of three outgroup species. All 100 NDGs were transcribed and under functional constraints, while 24% of the NDGs have differential expression patterns compared to their parental genes. We explored the underlying evolutionary forces of these paralogous pairs through conducting neutrality tests with sequence divergence and polymorphism data. Evolution of about 15% of NDGs appeared to be driven by natural selection. Moreover, we found that 3 NDGs not only altered their expression patterns when compared with parental genes, but also evolved under positive selection. We investigated the underlying mechanisms driving the differential expression of NDGs and their parents, and found a number of NDGs had different cis-elements and methylation patterns from their parental genes. Overall, we demonstrated that NDGs acquired divergent cis-elements and methylation patterns and may experience sub-functionalization or neo-functionalization influencing the evolution and adaption of A. thaliana.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / classification
  • Arabidopsis / genetics*
  • Arabidopsis Proteins / genetics
  • DNA Methylation
  • Evolution, Molecular*
  • Gene Duplication*
  • Gene Expression Regulation, Plant*
  • Genes, Duplicate*
  • Genetic Variation
  • Genetics, Population
  • Genome, Plant
  • Nucleotide Motifs
  • Phylogeny
  • Regulatory Sequences, Nucleic Acid
  • Species Specificity
  • Transcriptome

Substances

  • Arabidopsis Proteins

Grants and funding

This work was supported by start-up fund from Wayne State University to CF. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.