Development of a core SNP panel for cacao (Theobroma cacao L.) identity analysis

Genome. 2020 Feb;63(2):103-114. doi: 10.1139/gen-2019-0071. Epub 2019 Nov 4.

Abstract

Single nucleotide polymorphisms (SNPs) are preferred markers for DNA fingerprinting and diversity studies in cacao (Theobroma cacao L.). Yet, a consensus SNP panel with a minimum number of SNPs for optimal identity analysis is unavailable for cacao. An initial set of 146 SNP panels of varying sizes were assembled based on heterozygosity, linkage disequilibrium (LD), linkage group (LG) distribution, major allele frequency, minor allele frequency (MiAF), polymorphism information content (PIC), and random distribution. These panels were assessed to determine their ability to distinguish among a training set of 155 accessions. The panels with the best separation ability were supplemented with additional SNPs to create 16 designer panels, which separated all 155 accessions. The 16 designer SNP panels were then assessed on a dataset of 1220 accessions coming from 10 ancestral groups. Increasing the number of SNPs generally yielded improved resolution of genetic identities with concomitant reduction of synonymous groups. The number and choice of SNPs were critical factors with LD, MiAF, and PIC being important selection attributes but an even LG distribution was unnecessary. A robust set of 96 SNPs is recommended as a minimal core SNP panel for cacao DNA fingerprinting to the international cacao community.

Keywords: DNA fingerprinting; SNP panel; Theobroma cacao accessions; accessions du Theobroma cacao; analyse d’identité; diversité génétique; empreintes génétiques; genetic diversity; identity analysis; jeu de SNP.

MeSH terms

  • Cacao / genetics*
  • DNA Fingerprinting*
  • Gene Frequency
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide*