A Thousand Fly Genomes: An Expanded Drosophila Genome Nexus

Mol Biol Evol. 2016 Dec;33(12):3308-3313. doi: 10.1093/molbev/msw195. Epub 2016 Sep 29.


The Drosophila Genome Nexus is a population genomic resource that provides D. melanogaster genomes from multiple sources. To facilitate comparisons across data sets, genomes are aligned using a common reference alignment pipeline which involves two rounds of mapping. Regions of residual heterozygosity, identity-by-descent, and recent population admixture are annotated to enable data filtering based on the user's needs. Here, we present a significant expansion of the Drosophila Genome Nexus, which brings the current data object to a total of 1,121 wild-derived genomes. New additions include 305 previously unpublished genomes from inbred lines representing six population samples in Egypt, Ethiopia, France, and South Africa, along with another 193 genomes added from recently-published data sets. We also provide an aligned D. simulans genome to facilitate divergence comparisons. This improved resource will broaden the range of population genomic questions that can addressed from multi-population allele frequencies and haplotypes in this model species. The larger set of genomes will also enhance the discovery of functionally relevant natural variation that exists within and between populations.

Keywords: Drosophila melanogaster; data resource.; genomic variation; reference alignment.

Publication types

  • Letter
  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Databases, Nucleic Acid
  • Drosophila melanogaster / genetics*
  • Gene Frequency
  • Genetic Variation
  • Genome, Insect*
  • Reference Standards
  • Selection, Genetic