The impact of population structure on genomic prediction in stratified populations

Theor Appl Genet. 2014 Mar;127(3):749-62. doi: 10.1007/s00122-013-2255-x. Epub 2014 Jan 24.

Abstract

Impacts of population structure on the evaluation of genomic heritability and prediction were investigated and quantified using high-density markers in diverse panels in rice and maize. Population structure is an important factor affecting estimation of genomic heritability and assessment of genomic prediction in stratified populations. In this study, our first objective was to assess effects of population structure on estimations of genomic heritability using the diversity panels in rice and maize. Results indicate population structure explained 33 and 7.5% of genomic heritability for rice and maize, respectively, depending on traits, with the remaining heritability explained by within-subpopulation variation. Estimates of within-subpopulation heritability were higher than that derived from quantitative trait loci identified in genome-wide association studies, suggesting 65% improvement in genetic gains. The second objective was to evaluate effects of population structure on genomic prediction using cross-validation experiments. When population structure exists in both training and validation sets, correcting for population structure led to a significant decrease in accuracy with genomic prediction. In contrast, when prediction was limited to a specific subpopulation, population structure showed little effect on accuracy and within-subpopulation genetic variance dominated predictions. Finally, effects of genomic heritability on genomic prediction were investigated. Accuracies with genomic prediction increased with genomic heritability in both training and validation sets, with the former showing a slightly greater impact. In summary, our results suggest that the population structure contribution to genomic prediction varies based on prediction strategies, and is also affected by the genetic architectures of traits and populations. In practical breeding, these conclusions may be helpful to better understand and utilize the different genetic resources in genomic prediction.

MeSH terms

  • Genetic Association Studies / methods*
  • Genetic Markers
  • Genome, Plant*
  • Genomics / methods*
  • Models, Genetic
  • Oryza / genetics
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci
  • Selection, Genetic
  • Zea mays / genetics

Substances

  • Genetic Markers