Multi-Ancestry Polygenic Risk Score for Coronary Heart Disease Based on an Ancestrally Diverse Genome-Wide Association Study and Population-Specific Optimization

Circ Genom Precis Med. 2024 Jun;17(3):e004272. doi: 10.1161/CIRCGEN.123.004272. Epub 2024 Feb 21.

Abstract

Background: Predictive performance of polygenic risk scores (PRS) varies across populations. To facilitate equitable clinical use, we developed PRS for coronary heart disease (CHD; PRSCHD) for 5 genetic ancestry groups.

Methods: We derived ancestry-specific and multi-ancestry PRSCHD based on pruning and thresholding (PRSPT) and ancestry-based continuous shrinkage priors (PRSCSx) applied to summary statistics from the largest multi-ancestry genome-wide association study meta-analysis for CHD to date, including 1.1 million participants from 5 major genetic ancestry groups. Following training and optimization in the Million Veteran Program, we evaluated the best-performing PRSCHD in 176,988 individuals across 9 diverse cohorts.

Results: Multi-ancestry PRSPT and PRSCSx outperformed ancestry-specific PRSPT and PRSCSx across a range of tuning values. Two best-performing multi-ancestry PRSCHD (ie, PRSPTmult and PRSCSxmult) and 1 ancestry-specific (PRSCSxEUR) were taken forward for validation. PRSPTmult demonstrated the strongest association with CHD in individuals of South Asian ancestry and European ancestry (odds ratio per 1 SD [95% CI, 2.75 [2.41-3.14], 1.65 [1.59-1.72]), followed by East Asian ancestry (1.56 [1.50-1.61]), Hispanic/Latino ancestry (1.38 [1.24-1.54]), and African ancestry (1.16 [1.11-1.21]). PRSCSxmult showed the strongest associations in South Asian ancestry (2.67 [2.38-3.00]) and European ancestry (1.65 [1.59-1.71]), lower in East Asian ancestry (1.59 [1.54-1.64]), Hispanic/Latino ancestry (1.51 [1.35-1.69]), and the lowest in African ancestry (1.20 [1.15-1.26]).

Conclusions: The use of summary statistics from a large multi-ancestry genome-wide meta-analysis improved the performance of PRSCHD in most ancestry groups compared with single-ancestry methods. Despite the use of one of the largest and most diverse sets of training and validation cohorts to date, improvement of predictive performance was limited in African ancestry. This highlights the need for larger genome-wide association study datasets of underrepresented populations to enhance the performance of PRSCHD.

Keywords: coronary disease; genetic association studies; genetic predisposition to disease; genetic risk score; genetic variation; odds ratio.

MeSH terms

  • Coronary Disease* / genetics
  • Female
  • Genetic Predisposition to Disease
  • Genetic Risk Score
  • Genome-Wide Association Study*
  • Humans
  • Male
  • Middle Aged
  • Multifactorial Inheritance*
  • Polymorphism, Single Nucleotide
  • Risk Factors