Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2021 Jul 1;108(7):1251-1269.
doi: 10.1016/j.ajhg.2021.05.014.

Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects

Affiliations
Comparative Study

Constrained maximum likelihood-based Mendelian randomization robust to both correlated and uncorrelated pleiotropic effects

Haoran Xue et al. Am J Hum Genet. .

Abstract

With the increasing availability of large-scale GWAS summary data on various complex traits and diseases, there have been tremendous interests in applications of Mendelian randomization (MR) to investigate causal relationships between pairs of traits using SNPs as instrumental variables (IVs) based on observational data. In spite of the potential significance of such applications, the validity of their causal conclusions critically depends on some strong modeling assumptions required by MR, which may be violated due to the widespread (horizontal) pleiotropy. Although many MR methods have been proposed recently to relax the assumptions by mainly dealing with uncorrelated pleiotropy, only a few can handle correlated pleiotropy, in which some SNPs/IVs may be associated with hidden confounders, such as some heritable factors shared by both traits. Here we propose a simple and effective approach based on constrained maximum likelihood and model averaging, called cML-MA, applicable to GWAS summary data. To deal with more challenging situations with many invalid IVs with only weak pleiotropic effects, we modify and improve it with data perturbation. Extensive simulations demonstrated that the proposed methods could control the type I error rate better while achieving higher power than other competitors. Applications to 48 risk factor-disease pairs based on large-scale GWAS summary data of 3 cardio-metabolic diseases (coronary artery disease, stroke, and type 2 diabetes), asthma, and 12 risk factors confirmed its superior performance.

Keywords: GWAS; causal inference; data perturbation; goodness-of-fit test; instrumental variable.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Causal model with exposure X and outcome Y (A) Three IV assumptions. (B) A general causal model.
Figure 2
Figure 2
Main simulations: Empirical type I error rates at the nominal level of 0.05 with sample size n = 50,000 and with m = 10 or 100 SNPs, among which 0 to 60% were invalid IVs with the InSIDE assumption either holding or violated
Figure 3
Figure 3
Main simulations: Empirical type I error (for θ=0) and power (for θ0) curves with sample size n = 50,000
Figure 4
Figure 4
Main simulations: Empirical distributions of the estimates of the causal effect θ by the methods with n = 50,000 and θ=0 The numbers below each panel are the mean(θˆ), SD(θˆ), mean squared error (MSE) of θˆ from top to bottom.
Figure 5
Figure 5
Main simulations: Empirical distributions of the estimates of the causal effect θ by the methods with n = 50,000 and θ=0.1 The numbers below each panel are the mean(θˆ), SD(θˆ), mean squared error (MSE) of θˆ from top to bottom.
Figure 6
Figure 6
Secondary simulations: Empirical type I error rates (for θ=0) and power (for θ0) with sample size n = 50,000 or 100,000, and with m = 10 or 100 exposure-associated SNPs
Figure 7
Figure 7
Simulation results with many invalid IVs having weak pleiotropic effects (A) Empirical type I error (for θ=0) and power (for θ0) curves with hy = 0.2 and hu = 0. (B) Relative frequencies of the goodness-of-fit tests rejecting the null hypothesis.
Figure 8
Figure 8
Results of cML-MA-BIC, MR-CAUSE, MR-Mix, and MR-IVW to detect causal relationships among 48 risk factor-disease pairs
Figure 9
Figure 9
The numbers of the significant risk factor-disease pairs detected by various methods at the significance cutoff of p value < 0.001 MR-RAPS refers to RAPS2 (with the Tukey loss and overdispersion).
Figure 10
Figure 10
BIC and scatter plots for four pairs For four risk factor-disease pairs, the left panels show the numbers of invalid IVs versus BIC values, while the right panels show βˆXi versus βˆYi (with their errors bars indicating σˆXi and σˆYi). In the right panels, those for invalid IVs detected by BIC are blue, the red solid lines give the causal estimates (after removing the detected invalid IVs), and the black dashed lines are for the estimates based on all IVs.
Figure 11
Figure 11
Q-Q plots for 53 (likely) null trait-pairs in the secondary real data examples

Similar articles

Cited by

References

    1. Verbanck M., Chen C.Y., Neale B., Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat. Genet. 2018;50:693–698. - PMC - PubMed
    1. Watanabe K., Stringer S., Frei O., Umićević Mirkov M., de Leeuw C., Polderman T.J.C., van der Sluis S., Andreassen O.A., Neale B.M., Posthuma D. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 2019;51:1339–1348. - PubMed
    1. Bowden J., Davey Smith G., Haycock P.C., Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 2016;40:304–314. - PMC - PubMed
    1. Bowden J., Del Greco M F., Minelli C., Davey Smith G., Sheehan N., Thompson J. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat. Med. 2017;36:1783–1802. - PMC - PubMed
    1. Zhao J., Ming J., Hu X., Chen G., Liu J., Yang C. Bayesian weighted Mendelian randomization for causal inference based on summary statistics. Bioinformatics. 2020;36:1501–1508. - PubMed

Publication types

LinkOut - more resources