CONDITIONAL DISTANCE CORRELATION
- PMID: 26877569
- PMCID: PMC4749041
- DOI: 10.1080/01621459.2014.993081
CONDITIONAL DISTANCE CORRELATION
Abstract
Statistical inference on conditional dependence is essential in many fields including genetic association studies and graphical models. The classic measures focus on linear conditional correlations, and are incapable of characterizing non-linear conditional relationship including non-monotonic relationship. To overcome this limitation, we introduces a nonparametric measure of conditional dependence for multivariate random variables with arbitrary dimensions. Our measure possesses the necessary and intuitive properties as a correlation index. Briefly, it is zero almost surely if and only if two multivariate random variables are conditionally independent given a third random variable. More importantly, the sample version of this measure can be expressed elegantly as the root of a V or U-process with random kernels and has desirable theoretical properties. Based on the sample version, we propose a test for conditional independence, which is proven to be more powerful than some recently developed tests through our numerical simulations. The advantage of our test is even greater when the relationship between the multivariate random variables given the third random variable cannot be expressed in a linear or monotonic function of one random variable versus the other. We also show that the sample measure is consistent and weakly convergent, and the test statistic is asymptotically normal. By applying our test in a real data analysis, we are able to identify two conditionally associated gene expressions, which otherwise cannot be revealed. Thus, our measure of conditional dependence is not only an ideal concept, but also has important practical utility.
Keywords: Conditional distance correlation; Conditional independence test; Local bootstrap; U(V) process with random kernel.
Figures
Similar articles
-
Conditional independence test by generalized Kendall's tau with generalized odds ratio.Stat Methods Med Res. 2018 Nov;27(11):3224-3235. doi: 10.1177/0962280217695345. Epub 2017 Feb 23. Stat Methods Med Res. 2018. PMID: 29298614 Free PMC article.
-
Low-order conditional independence graphs for inferring genetic networks.Stat Appl Genet Mol Biol. 2006;5:Article1. doi: 10.2202/1544-6115.1170. Epub 2006 Jan 4. Stat Appl Genet Mol Biol. 2006. PMID: 16646863
-
Dependence and independence: Structure and inference.Stat Methods Med Res. 2017 Oct;26(5):2114-2132. doi: 10.1177/0962280215594198. Epub 2015 Jul 29. Stat Methods Med Res. 2017. PMID: 26229085
-
Learning dependence from samples.Int J Bioinform Res Appl. 2014;10(1):43-58. doi: 10.1504/IJBRA.2014.058777. Int J Bioinform Res Appl. 2014. PMID: 24449692
-
Ball Covariance: A Generic Measure of Dependence in Banach Space.J Am Stat Assoc. 2020;115(529):307-317. doi: 10.1080/01621459.2018.1543600. Epub 2019 Apr 11. J Am Stat Assoc. 2020. PMID: 33299261 Free PMC article.
Cited by
-
The Chi-Square Test of Distance Correlation.J Comput Graph Stat. 2022;31(1):254-262. doi: 10.1080/10618600.2021.1938585. Epub 2021 Jul 19. J Comput Graph Stat. 2022. PMID: 35707063 Free PMC article.
-
A Projection-based Conditional Dependence Measure with Applications to High-dimensional Undirected Graphical Models.J Econom. 2020 Sep;218(1):119-139. doi: 10.1016/j.jeconom.2019.12.016. Epub 2020 Feb 15. J Econom. 2020. PMID: 33208987 Free PMC article.
-
Phylogenetic association analysis with conditional rank correlation.Biometrika. 2023 Dec 1;111(3):881-902. doi: 10.1093/biomet/asad075. eCollection 2024 Sep. Biometrika. 2023. PMID: 39239268
-
Projection expectile regression for sufficient dimension reduction.Comput Stat Data Anal. 2023 Apr;180:107666. doi: 10.1016/j.csda.2022.107666. Epub 2022 Nov 25. Comput Stat Data Anal. 2023. PMID: 36506351 Free PMC article.
-
Discovering and deciphering relationships across disparate data modalities.Elife. 2019 Jan 15;8:e41690. doi: 10.7554/eLife.41690. Elife. 2019. PMID: 30644820 Free PMC article.
References
-
- Ackley H, Hinton E, Sejnowski J. A learning algorithm for Boltzmann machines. Cognitive Science. 1985:147–169.
-
- Fan Y, Li Q. Consistent model specification tests: omitted variables and semiparametric functional forms. Econometrica: Journal of the Econometric Society. 1996:865–890.
-
- Fukumizu K, Gretton A, Sun X, Schölkopf B. Kernel measures of conditional dependence. Conference on Neural Information Processing Systems 2008
-
- Gretton A, Bousquet O, Smola A, Schölkopf B. Measuring statistical dependence with Hilbert-Schmidt norms. Proceedings Algorithmic Learning Theory. 2005:63–77.
-
- Hageman GS, Anderson DH, Johnson LV, Hancox LS, Taiber AJ, Hardisty LI, Hageman JL, Stockman HA, Borchardt JD, Gehrs KM, et al. A common haplotype in the complement regulatory gene factor H (HF1/CFH) predisposes individuals to age-related macular degeneration. Proceedings of the National Academy of Sciences of the United States of America. 2005;102(20):7227–7232. - PMC - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources