Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 26;17(8):e1009723.
doi: 10.1371/journal.pgen.1009723. eCollection 2021 Aug.

The impact of age on genetic risk for common diseases

Affiliations

The impact of age on genetic risk for common diseases

Xilin Jiang et al. PLoS Genet. .

Abstract

Inherited genetic variation contributes to individual risk for many complex diseases and is increasingly being used for predictive patient stratification. Previous work has shown that genetic factors are not equally relevant to human traits across age and other contexts, though the reasons for such variation are not clear. Here, we introduce methods to infer the form of the longitudinal relationship between genetic relative risk for disease and age and to test whether all genetic risk factors behave similarly. We use a proportional hazards model within an interval-based censoring methodology to estimate age-varying individual variant contributions to genetic relative risk for 24 common diseases within the British ancestry subset of UK Biobank, applying a Bayesian clustering approach to group variants by their relative risk profile over age and permutation tests for age dependency and multiplicity of profiles. We find evidence for age-varying relative risk profiles in nine diseases, including hypertension, skin cancer, atherosclerotic heart disease, hypothyroidism and calculus of gallbladder, several of which show evidence, albeit weak, for multiple distinct profiles of genetic relative risk. The predominant pattern shows genetic risk factors having the greatest relative impact on risk of early disease, with a monotonic decrease over time, at least for the majority of variants, although the magnitude and form of the decrease varies among diseases. As a consequence, for diseases where genetic relative risk decreases over age, genetic risk factors have stronger explanatory power among younger populations, compared to older ones. We show that these patterns cannot be explained by a simple model involving the presence of unobserved covariates such as environmental factors. We discuss possible models that can explain our observations and the implications for genetic risk prediction.

PubMed Disclaimer

Conflict of interest statement

G.M. is a director of and shareholder in Genomics PLC, and is a partner in Peptide Groove LLP. The other authors declare no competing financial interests.

Figures

Fig 1
Fig 1. Schematic representation of methodology.
(A) Independent variants associated with a trait of interest are identified by analysis of the entire UK Biobank cohort using the TreeWAS methodology [40]. (B) Logistic regression is applied to estimate coefficients for variants on each trait using the training set. (C) Coefficients are used to compute individual genetic risk scores; the odds ratio associated with high GRS within each age group are estimated in the testing set. (D) An interval-censored proportional hazards model [44] is used to estimate the effect (and associated standard error) of each variant on the trait of interest within each of eight age intervals. (E) Bayesian clustering is used to estimate population age-profiles of risk, using either linear models or quadratic polynomials to encourage smoothness. (F-H) Permutation is used to test for age-homogeneity of effect size as well as to assess the evidence for multiple age profiles.
Fig 2
Fig 2. Age-stratified odds-ratios for combined genetic risk scores.
(A-F) Age-stratified odds ratios in held-out testing data for genetic risk scores for six disorders where there is evidence for a single non-constant genetic risk profile, “Primary (essential) hypertension” (ICD-10 code I10), “pure hypercholesterolaemia” (E78.0); “Calculus of gallbladder without cholecystitis” (K80.2) and “Hypothyroidism, unspecified” (E03.9); “atherosclerotic heart disease of native coronary artery” (I25.1) and “other and unspecified malignant neoplasm of skin and unspecified parts of face” (C44.3). Results for all diseases are shown in S1 Fig. Odds ratios for the 80th (blue) and 90th percentiles of a combined genetic risk score within matched case-control samples (four controls for each case) are shown for each age interval; points indicate the average odds ratio of twenty five-fold cross-validation analyses with lines indicating the 95% confidence interval.
Fig 3
Fig 3. Overview of simulation results.
(A) Power at P ≤ 0.05 to detect deviation from age-homogeneity as a function of slope in a model where effect sizes change linearly with age. The blue line indicates the point estimate when using a linear model to fit, the red line indicates the point estimate with a quadratic polynomial model and the grey shading indicates the 95% confidence interval. (B) Example showing the age-profile under which data are simulated (dashed blue line) and the inferred age profile (dashed red line) and 95% credible interval (red shading). (C) Power at P ≤ 0.05 to detect multiple age profiles in a simulation where 90% of variants have a time-invariant profile and 10% have an effect size that increases with age. The solid blue line indicates power when fitting a linear model and the solid red line indicates power when fitting a quadratic model. The dashed red line indicates the nominal significance threshold. Note the change in x-axis scale compared to Fig 2A. (D) Example showing inferred age-profiles for the two components (mean posterior and 95% credible interval). Additional simulation details are provided in the S1 Supplemental Methods and S2 Fig.
Fig 4
Fig 4. Age-varying disease risk profiles.
(A-D) Inferred cluster profiles for four disorders where there is evidence for single non-constant profile; “Primary (essential) hypertension” (ICD-10 code I10; P = 0.0001), “pure hypercholesterolaemia” (E78.0; P = 0.0001), “Calculus of gallbladder without cholecystitis” (K80.2; P = 0.0236) and “Hypothyroidism, unspecified” (E03.9, P = 0.0329); (E-F) Inferred cluster profiles for two disorders where there is evidence for multiple non-constant profiles; “atherosclerotic heart disease of native coronary artery” (I25.1; P = 0.0001) and “other and unspecified malignant neoplasm of skin and unspecified parts of face” (C44.3; P = 0.0092). Curves for all diseases are shown in S4 Fig; Curves for all UK Biobank subjects regardless of ethnic background and for subjects from Black or South Asian ethnic background are shown in S6 Fig. The solid line indicates the posterior mean and the shaded area the 95% credible interval; Numbers in boxes indicate the number of variants in each cluster; All estimates are made with quadratic models for age-varying risk profiles.
Fig 5
Fig 5. The impact of frailty on genetic risk profiles.
(A) Estimated age-profiles for genetic risk for I10 “essential (primary) hypertension” (left) and I25.1 “atherosclerotic heart disease of native coronary artery” (right) fitted under the univariable (purple) and multivariable (green) approaches. For I10, the solid line indicates the posterior mean and the shaded area the 95% credible interval; For I25.1, the solid and dashed lines indicate the means for the two clusters of variants. Comparisons for all diseases are shown in S10 Fig. (B) Estimated incidence by age for K80.2 “Calculus of gallbladder without cholecystitis” (left) and C44.3 “Other and unspecified malignant neoplasm of skin and unspecified parts of face” (right). The red solid line indicates the rate estimated from the UK Biobank (see S1 Supplemental Methods) and the dotted blue line indicates the fitted incidence curve from the parametric model. The P value indicates the Goodness-of-Fit test. Curves for all diseases are shown in S11 Fig. (C) Comparison of inferred genetic effect sizes (red curve) and those implied by the frailty parameters estimated from incidence rate within the UK Biobank (blue dashed curve).
Fig 6
Fig 6. Models for a decreasing influence of genetic risk with age.
(A) A threshold model, in which each individual has a disease “liability” which evolves over age. Disease onset occurs when liability crosses a threshold. The upper panel shows example trajectories, where genetic risk alters only the liability baseline. The middle panel is a schematic representation of a simulation in which genetic risk affects developmental pathways at birth, while non-genetic risk accumulates over time. The lower panel shows an estimation of the effect size from a simulated dataset of UK Biobank sample size (see S1 Supplemental Methods). (B) Interactions between genetic and environmental risk factors can create a distribution of effect sizes for a specific genotype. The upper panel shows example trajectories, where the environment influences the slope of the trajectory. The middle panel shows illustrative examples of the liability distributions among individuals at different ages. Those individuals at highest risk (with both the risk allele and risk environment) enter disease earlier, diluting the apparent effect size at a later age. The lower panel shows simulation results under such a model using realistic parameters from UK Biobank (see S1 Supplemental Methods).

Similar articles

Cited by

References

    1. Rossouw JE. Hormones, genetic factors, and gender differences in cardiovascular disease. Cardiovasc Res. 2002;53: 550–557. doi: 10.1016/s0008-6363(01)00478-3 - DOI - PubMed
    1. Ripatti S, Tikkanen E, Orho-Melander M, Havulinna AS, Silander K, Sharma A, et al.. A multilocus genetic risk score for coronary heart disease: case-control and prospective cohort analyses. Lancet. 2010;376: 1393–1400. doi: 10.1016/S0140-6736(10)61267-6 - DOI - PMC - PubMed
    1. Mavaddat N, Pharoah PDP, Michailidou K, Tyrer J, Brook MN, Bolla MK, et al.. Prediction of breast cancer risk based on profiling with common genetic variants. J Natl Cancer Inst. 2015;107. doi: 10.1093/jnci/djv036 - DOI - PMC - PubMed
    1. Cotsapas C, Voight BF, Rossin E, Lage K, Neale BM, Wallace C, et al.. Pervasive sharing of genetic effects in autoimmune disease. PLoS Genet. 2011;7: e1002254. doi: 10.1371/journal.pgen.1002254 - DOI - PMC - PubMed
    1. Choi SW, Mak TSH, O’Reilly PF. A guide to performing Polygenic Risk Score analyses. 2020. doi: 10.1101/416545 - DOI - PMC - PubMed

Publication types