An improved genotyping by sequencing (GBS) approach offering increased versatility and efficiency of SNP discovery and genotyping

PLoS One. 2013;8(1):e54603. doi: 10.1371/journal.pone.0054603. Epub 2013 Jan 23.

Abstract

Highly parallel SNP genotyping platforms have been developed for some important crop species, but these platforms typically carry a high cost per sample for first-time or small-scale users. In contrast, recently developed genotyping by sequencing (GBS) approaches offer a highly cost effective alternative for simultaneous SNP discovery and genotyping. In the present investigation, we have explored the use of GBS in soybean. In addition to developing a novel analysis pipeline to call SNPs and indels from the resulting sequence reads, we have devised a modified library preparation protocol to alter the degree of complexity reduction. We used a set of eight diverse soybean genotypes to conduct a pilot scale test of the protocol and pipeline. Using ApeKI for GBS library preparation and sequencing on an Illumina GAIIx machine, we obtained 5.5 M reads and these were processed using our pipeline. A total of 10,120 high quality SNPs were obtained and the distribution of these SNPs mirrored closely the distribution of gene-rich regions in the soybean genome. A total of 39.5% of the SNPs were present in genic regions and 52.5% of these were located in the coding sequence. Validation of over 400 genotypes at a set of randomly selected SNPs using Sanger sequencing showed a 98% success rate. We then explored the use of selective primers to achieve a greater complexity reduction during GBS library preparation. The number of SNP calls could be increased by almost 40% and their depth of coverage was more than doubled, thus opening the door to an increase in the throughput and a significant decrease in the per sample cost. The approach to obtain high quality SNPs developed here will be helpful for marker assisted genomics as well as assessment of available genetic resources for effective utilisation in a wide number of species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping
  • Evolution, Molecular
  • Genome, Plant
  • Genomics
  • Genotype
  • Genotyping Techniques / methods*
  • Genotyping Techniques / standards*
  • Glycine max / classification
  • Glycine max / genetics
  • High-Throughput Nucleotide Sequencing*
  • Phylogeny
  • Polymorphism, Single Nucleotide*
  • Reproducibility of Results

Grants and funding

Funding for this research was provided by the Natural Sciences and Engineering Research Council of Canada and Semences Prograin Inc. (Grant no. RDCPJ 382089–09), Agriculture and AgriFood Canada and the Canadian Field Crop Research Alliance (Grant no. DIAP 5987) and the Canadian International Development Agency (CGIAR-Canada Linkage Fund). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.