Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features

Proc Natl Acad Sci U S A. 2006 Aug 1;103(31):11647-52. doi: 10.1073/pnas.0604795103. Epub 2006 Jul 25.


The green lineage is reportedly 1,500 million years old, evolving shortly after the endosymbiosis event that gave rise to early photosynthetic eukaryotes. In this study, we unveil the complete genome sequence of an ancient member of this lineage, the unicellular green alga Ostreococcus tauri (Prasinophyceae). This cosmopolitan marine primary producer is the world's smallest free-living eukaryote known to date. Features likely reflecting optimization of environmentally relevant pathways, including resource acquisition, unusual photosynthesis apparatus, and genes potentially involved in C(4) photosynthesis, were observed, as was downsizing of many gene families. Overall, the 12.56-Mb nuclear genome has an extremely high gene density, in part because of extensive reduction of intergenic regions and other forms of compaction such as gene fusion. However, the genome is structurally complex. It exhibits previously unobserved levels of heterogeneity for a eukaryote. Two chromosomes differ structurally from the other eighteen. Both have a significantly biased G+C content, and, remarkably, they contain the majority of transposable elements. Many chromosome 2 genes also have unique codon usage and splicing, but phylogenetic analysis and composition do not support alien gene origin. In contrast, most chromosome 19 genes show no similarity to green lineage genes and a large number of them are specialized in cell surface processes. Taken together, the complete genome sequence, unusual features, and downsized gene families, make O. tauri an ideal model system for research on eukaryotic genome evolution, including chromosome specialization and green lineage ancestry.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chlorophyta / genetics*
  • Chromosomes
  • Eukaryotic Cells*
  • Evolution, Molecular
  • Genome*
  • Molecular Sequence Data
  • Sequence Analysis, DNA

Associated data

  • GENBANK/CR954201
  • GENBANK/CR954202
  • GENBANK/CR954203
  • GENBANK/CR954204
  • GENBANK/CR954205
  • GENBANK/CR954206
  • GENBANK/CR954207
  • GENBANK/CR954208
  • GENBANK/CR954209
  • GENBANK/CR954210
  • GENBANK/CR954211
  • GENBANK/CR954212
  • GENBANK/CR954213
  • GENBANK/CR954214
  • GENBANK/CR954215
  • GENBANK/CR954216
  • GENBANK/CR954217
  • GENBANK/CR954218
  • GENBANK/CR954219
  • GENBANK/CR954220