A catalog of annotated high-confidence SNPs from exome capture and sequencing reveals highly polymorphic genes in Norway spruce (Picea abies)

BMC Genomics. 2018 Dec 17;19(1):942. doi: 10.1186/s12864-018-5247-z.

Abstract

Background: Norway spruce [Picea abies (L.) Karst.] is ecologically and economically one of the most important conifer worldwide. Our main goal was to develop a large catalog of annotated high confidence gene SNPs that should sustain the development of genomic tools for the conservation of natural and domesticated genetic diversity resources, and hasten tree breeding efforts in this species.

Results: Targeted sequencing was achieved by capturing P. abies exome with probes previously designed from the sequenced transcriptome of white spruce (Picea glauca (Moench) Voss). Capture efficiency was high (74.5%) given a high level of exome conservation between the two species. Using stringent criteria, we delimited a set of 61,771 high-confidence SNPs across 13,543 genes. To validate SNPs, a high-throughput genotyping array was developed for a subset of 5571 predicted SNPs representing as many different gene loci, and was used to genotype over 1000 trees. The estimated true positive rate of the resource was 84.2%, which was comparable with the genotyping success rate obtained for P. abies control SNPs recycled from previous genotyping efforts. We also analyzed SNP abundance across various gene functional categories. Several GO terms and gene families involved in stress response were found over-represented in highly polymorphic genes.

Conclusion: The annotated high-confidence SNP catalog developed herein represents a valuable genomic resource, being representative of over 13 K genes distributed across the P. abies genome. This resource should serve a variety of population genomics and breeding applications in Norway spruce.

Keywords: Annotated gene SNPs; Conifer; Exome sequencing; Illumina Infinium iSelect array; Illumina MiSeq; Picea abies; SNP abundance; Single nucleotide polymorphism.

MeSH terms

  • Contig Mapping
  • DNA, Plant / isolation & purification
  • DNA, Plant / metabolism
  • Exome / genetics*
  • Genotype
  • Molecular Sequence Annotation
  • Picea / genetics*
  • Plant Leaves / genetics
  • Polymorphism, Single Nucleotide*
  • Sequence Analysis, DNA

Substances

  • DNA, Plant