Analysis of the Saccharomyces cerevisiae pan-genome reveals a pool of copy number variants distributed in diverse yeast strains from differing industrial environments

Genome Res. 2012 May;22(5):908-24. doi: 10.1101/gr.130310.111. Epub 2012 Feb 27.

Abstract

Although the budding yeast Saccharomyces cerevisiae is arguably one of the most well-studied organisms on earth, the genome-wide variation within this species--i.e., its "pan-genome"--has been less explored. We created a multispecies microarray platform containing probes covering the genomes of several Saccharomyces species: S. cerevisiae, including regions not found in the standard laboratory S288c strain, as well as the mitochondrial and 2-μm circle genomes-plus S. paradoxus, S. mikatae, S. kudriavzevii, S. uvarum, S. kluyveri, and S. castellii. We performed array-Comparative Genomic Hybridization (aCGH) on 83 different S. cerevisiae strains collected across a wide range of habitats; of these, 69 were commercial wine strains, while the remaining 14 were from a diverse set of other industrial and natural environments. We observed interspecific hybridization events, introgression events, and pervasive copy number variation (CNV) in all but a few of the strains. These CNVs were distributed throughout the strains such that they did not produce any clear phylogeny, suggesting extensive mating in both industrial and wild strains. To validate our results and to determine whether apparently similar introgressions and CNVs were identical by descent or recurrent, we also performed whole-genome sequencing on nine of these strains. These data may help pinpoint genomic regions involved in adaptation to different industrial milieus, as well as shed light on the course of domestication of S. cerevisiae.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping
  • Cluster Analysis
  • Comparative Genomic Hybridization
  • DNA Copy Number Variations*
  • DNA Transposable Elements / genetics
  • Genetic Variation
  • Genome, Fungal*
  • Genome, Mitochondrial
  • Hybridization, Genetic
  • Plasmids / genetics
  • Principal Component Analysis
  • Recombination, Genetic
  • Saccharomyces cerevisiae / genetics*
  • Sequence Analysis, DNA
  • Telomere / genetics

Substances

  • DNA Transposable Elements

Associated data

  • GEO/GSE26689