Genetic prediction of quantitative lipid traits: comparing shrinkage models to gene scores
- PMID: 24272946
- DOI: 10.1002/gepi.21777
Genetic prediction of quantitative lipid traits: comparing shrinkage models to gene scores
Abstract
Accurate genetic prediction of quantitative traits related to complex disease risk would have potential clinical impact, so investigation of statistical methodology to improve predictive performance is important. We compare a simple approach of polygenic scores using top ranking single nucleotide polymorphisms (SNPs) to a set of shrinkage models, namely Ridge Regression, Lasso and Hyper-Lasso. These penalised regression methods analyse all genotyped SNPs simultaneously, potentially including much larger sets of SNPs in the models, not only those with the smallest P values. We compare the accuracy of these models for predicting low-density lipoprotein (LDL) and high-density lipoprotein (HDL) cholesterol, two lipid traits of clinical relevance, in the Whitehall II and British Women's Health and Heart Study cohorts, using SNPs from the HumanCVD BeadChip. For gene scores, the most accurate predictions arise from multivariate weighted scores and include only a small number of SNPs, identified as top hits by the HumanCVD BeadChip. Furthermore, there was little benefit from including external results from published sets of SNPs. We found that shrinkage approaches rarely improved significantly on gene score results. Genetic predictive performance is trait specific, depending on the heritability and genetic architecture of the trait, and is limited by the training data sample size. Our results for lipid traits suggest no current benefit of more complex methods over existing gene score methods. Instead, the most important choice for the prediction model is the number of SNPs and selection of the most predictive SNPs to include. However further comparisons, in larger samples and for other phenotypes, would still be of interest.
Keywords: SNP selection; lipids; penalised regression; polygenic score; prediction.
© 2013 WILEY PERIODICALS, INC.
Similar articles
-
Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4. Genet Sel Evol. 2015. PMID: 26698091 Free PMC article.
-
Genomic prediction of complex human traits: relatedness, trait architecture and predictive meta-models.Hum Mol Genet. 2015 Jul 15;24(14):4167-82. doi: 10.1093/hmg/ddv145. Epub 2015 Apr 26. Hum Mol Genet. 2015. PMID: 25918167 Free PMC article.
-
Regional heritability mapping method helps explain missing heritability of blood lipid traits in isolated populations.Heredity (Edinb). 2016 Mar;116(3):333-8. doi: 10.1038/hdy.2015.107. Epub 2015 Dec 23. Heredity (Edinb). 2016. PMID: 26696135 Free PMC article.
-
Complex-Trait Prediction in the Era of Big Data.Trends Genet. 2018 Oct;34(10):746-754. doi: 10.1016/j.tig.2018.07.004. Epub 2018 Aug 20. Trends Genet. 2018. PMID: 30139641 Free PMC article. Review.
-
Genetic determinants of inherited susceptibility to hypercholesterolemia - a comprehensive literature review.Lipids Health Dis. 2017 Jun 2;16(1):103. doi: 10.1186/s12944-017-0488-4. Lipids Health Dis. 2017. PMID: 28577571 Free PMC article. Review.
Cited by
-
Smooth-threshold multivariate genetic prediction incorporating gene-environment interactions.G3 (Bethesda). 2021 Dec 8;11(12):jkab278. doi: 10.1093/g3journal/jkab278. G3 (Bethesda). 2021. PMID: 34849749 Free PMC article.
-
GWAS findings improved genomic prediction accuracy of lipid profile traits: Tehran Cardiometabolic Genetic Study.Sci Rep. 2021 Mar 11;11(1):5780. doi: 10.1038/s41598-021-85203-8. Sci Rep. 2021. PMID: 33707626 Free PMC article.
-
A genome-wide association study identifying the SNPs predictive of rapid joint destruction in patients with rheumatoid arthritis.Biomed Rep. 2021 Mar;14(3):31. doi: 10.3892/br.2021.1407. Epub 2021 Jan 29. Biomed Rep. 2021. PMID: 33585033 Free PMC article.
-
Investigation of prediction accuracy and the impact of sample size, ancestry, and tissue in transcriptome-wide association studies.Genet Epidemiol. 2020 Jul;44(5):425-441. doi: 10.1002/gepi.22290. Epub 2020 Mar 19. Genet Epidemiol. 2020. PMID: 32190932 Free PMC article.
-
Efficient Estimation and Applications of Cross-Validated Genetic Predictions to Polygenic Risk Scores and Linear Mixed Models.J Comput Biol. 2020 Apr;27(4):599-612. doi: 10.1089/cmb.2019.0325. Epub 2020 Feb 20. J Comput Biol. 2020. PMID: 32077750 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
- AG13196/AG/NIA NIH HHS/United States
- PG/09/022/26739/BHF_/British Heart Foundation/United Kingdom
- SP/07/007/23671/BHF_/British Heart Foundation/United Kingdom
- RG/08/008/BHF_/British Heart Foundation/United Kingdom
- G1000718/MRC_/Medical Research Council/United Kingdom
- HS06516/HS/AHRQ HHS/United States
- PG/07/133/24260/BHF_/British Heart Foundation/United Kingdom
- MR/K013351/1/MRC_/Medical Research Council/United Kingdom
- MR/K006215/1/MRC_/Medical Research Council/United Kingdom
- 0090049/DH_/Department of Health/United Kingdom
- G0801414/MRC_/Medical Research Council/United Kingdom
- PG/09/022/BHF_/British Heart Foundation/United Kingdom
- HL36310/HL/NHLBI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
