Multi-individual microsatellite identification: A multiple genome approach to microsatellite design (MiMi)

Mol Ecol Resour. 2019 Nov;19(6):1672-1680. doi: 10.1111/1755-0998.13065. Epub 2019 Aug 27.

Abstract

Bespoke microsatellite marker panels are increasingly affordable and tractable to researchers and conservationists. The rate of microsatellite discovery is very high within a shotgun genomic data set, but extensive laboratory testing of markers is required for confirmation of amplification and polymorphism. By incorporating shotgun next-generation sequencing data sets from multiple individuals of the same species, we have developed a new method for the optimal design of microsatellite markers. This new tool allows us to increase the rate at which suitable candidate markers are selected by 58% in direct comparisons and facilitate an estimated 16% reduction in costs associated with producing a novel microsatellite panel. Our method enables the visualisation of each microsatellite locus in a multiple sequence alignment allowing several important quality checks to be made. Polymorphic loci can be identified and prioritised. Loci containing fragment-length-altering mutations in the flanking regions, which may invalidate assumptions regarding the model of evolution underlying variation at the microsatellite, can be avoided. Priming regions containing point mutations can be detected and avoided, helping to reduce sample-site-marker specificity arising from genetic isolation, and the likelihood of null alleles occurring. We demonstrate the utility of this new approach in two species: an echinoderm and a bird. Our method makes a valuable contribution towards minimising genotyping errors and reducing costs associated with developing a novel marker panel. The Python script to perform our method of multi-individual microsatellite identification (MiMi) is freely available from GitHub (https://github.com/graemefox/mimi).

Keywords: cost-effective marker development; high-throughput sequencing; in silico quality control; microsatellite design; polymorphic loci detection; short tandem repeat (STR).

MeSH terms

  • Alleles
  • Genetic Markers / genetics
  • Genome / genetics*
  • Genomics / methods
  • Genotype
  • High-Throughput Nucleotide Sequencing / methods
  • Microsatellite Repeats / genetics*
  • Point Mutation / genetics
  • Polymorphism, Genetic / genetics

Substances

  • Genetic Markers