Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec;76(4):1262-1272.
doi: 10.1111/biom.13214. Epub 2020 Jan 13.

Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies

Affiliations

Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies

Zachary R McCaw et al. Biometrics. 2020 Dec.

Abstract

Quantitative traits analyzed in Genome-Wide Association Studies (GWAS) are often nonnormally distributed. For such traits, association tests based on standard linear regression are subject to reduced power and inflated type I error in finite samples. Applying the rank-based inverse normal transformation (INT) to nonnormally distributed traits has become common practice in GWAS. However, the different variations on INT-based association testing have not been formally defined, and guidance is lacking on when to use which approach. In this paper, we formally define and systematically compare the direct (D-INT) and indirect (I-INT) INT-based association tests. We discuss their assumptions, underlying generative models, and connections. We demonstrate that the relative powers of D-INT and I-INT depend on the underlying data generating process. Since neither approach is uniformly most powerful, we combine them into an adaptive omnibus test (O-INT). O-INT is robust to model misspecification, protects the type I error, and is well powered against a wide range of nonnormally distributed traits. Extensive simulations were conducted to examine the finite sample operating characteristics of these tests. Our results demonstrate that, for nonnormally distributed traits, INT-based tests outperform the standard untransformed association test, both in terms of power and type I error rate control. We apply the proposed methods to GWAS of spirometry traits in the UK Biobank. O-INT has been implemented in the R package RNOmni, which is available on CRAN.

Keywords: direct and indirect rank-based inverse normal transformation; nonnormality; omnibus test; quantitative traits; transformation; type I error rate.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Distribution of association p-values under the null at sample size n =103 across R =108 simulation replicates. Rows correspond to different phenotype distributions. The first phenotype has normal residuals; the second has χ12 residuals; the third phenotype has t3 residuals; and the log of the fourth phenotype has normal residuals. Columns correspond to different association tests. The first is the untransformed association test (UAT), the second is the direct INT (D-INT), the third is indirect INT (I-INT), and the fourth column is omnibus INT (O-INT). Note that this figure appears in color in the electronic version of this article, and any mention of color refers to that version
FIGURE 2
FIGURE 2
Power curves at α =10−6 and sample size n =103 across R =106 simulation replicates. Simulations were conducted at heritabilities ranging from 0.1% and 1.0%. Gray is the UAT, blue is D-INT, I-INT, and red is O-INT. Each panel corresponds to a different phenotype. The first phenotype has normal residuals; the second has χ12 residuals; the third phenotype has t3 residuals; and the log of the fourth phenotype has normal residuals. Note that this figure appears in color in the electronic version of this article, and any mention of color refers to that version

Similar articles

Cited by

References

    1. Abbott L, Bryant S, Churchhouse C, Ganna A, Howrigan D, Palmer D, Neale B, Walters R, Carey C for The Hail team. (2017) UK Biobank GWAS results, https://www.nealelab.is/uk-biobank (Accessed 2 January 2019).
    1. Barber MJ, Mangravite LM, Hyde CL, Chasman DI, Smith JD, McCarty CA et al. (2010) Genome-wide association of lipid-lowering response to statins in combined study populations. PLOS One, 5, e9763. - PMC - PubMed
    1. Beasley TM, Erickson S and Allison DB (2009) Rank-based inverse normal transformations are increasingly used, but are they merited? Behavioral Genetics, 39, 580–95. - PMC - PubMed
    1. Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Schizophrenia Working Group of the Psychiatric Genomics Consortium, Patterson N, Daly MJ, Price AL and Neale BM (2015) LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics, 47, 291–295. - PMC - PubMed
    1. Cade BE, Chen H, Stilp AM, Gleason KJ, Sofer T et al. (2016) Genetic associations with obstructive sleep apnea traits in Hispanic/Latino Americans. American Journal of Respiratory and Critical Care Medicine, 194, 886–897. - PMC - PubMed

Publication types