Statistical properties of multivariate distance matrix regression for high-dimensional data analysis
- PMID: 23060897
- PMCID: PMC3461701
- DOI: 10.3389/fgene.2012.00190
Statistical properties of multivariate distance matrix regression for high-dimensional data analysis
Abstract
Multivariate distance matrix regression (MDMR) analysis is a statistical technique that allows researchers to relate P variables to an additional M factors collected on N individuals, where P ≫ N. The technique can be applied to a number of research settings involving high-dimensional data types such as DNA sequence data, gene expression microarray data, and imaging data. MDMR analysis involves computing the distance between all pairs of individuals with respect to P variables of interest and constructing an N × N matrix whose elements reflect these distances. Permutation tests can be used to test linear hypotheses that consider whether or not the M additional factors collected on the individuals can explain variation in the observed distances between and among the N individuals as reflected in the matrix. Despite its appeal and utility, properties of the statistics used in MDMR analysis have not been explored in detail. In this paper we consider the level accuracy and power of MDMR analysis assuming different distance measures and analysis settings. We also describe the utility of MDMR analysis in assessing hypotheses about the appropriate number of clusters arising from a cluster analysis.
Keywords: distance matrix; multivariate analysis; regression analysis; simulation.
Figures
Similar articles
-
Extending multivariate distance matrix regression with an effect size measure and the asymptotic null distribution of the test statistic.Psychometrika. 2017 Dec;82(4):1052-1077. doi: 10.1007/s11336-016-9527-8. Epub 2016 Oct 13. Psychometrika. 2017. PMID: 27738957 Free PMC article.
-
Distance-based phenotypic association analysis of DNA sequence data.BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S54. doi: 10.1186/1753-6561-5-S9-S54. BMC Proc. 2011. PMID: 22373107 Free PMC article.
-
A regression framework for brain network distance metrics.Netw Neurosci. 2022 Feb 1;6(1):49-68. doi: 10.1162/netn_a_00214. eCollection 2022 Feb. Netw Neurosci. 2022. PMID: 35350586 Free PMC article.
-
Multivariate regression analysis of distance matrices for testing associations between gene expression patterns and related variables.Proc Natl Acad Sci U S A. 2006 Dec 19;103(51):19430-5. doi: 10.1073/pnas.0609333103. Epub 2006 Dec 4. Proc Natl Acad Sci U S A. 2006. PMID: 17146048 Free PMC article.
-
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification.In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 25. PMID: 26269925 Free Books & Documents. Review.
Cited by
-
A Connectome-wide Functional Signature of Transdiagnostic Risk for Mental Illness.Biol Psychiatry. 2018 Sep 15;84(6):452-459. doi: 10.1016/j.biopsych.2018.03.012. Epub 2018 Apr 10. Biol Psychiatry. 2018. PMID: 29779670 Free PMC article.
-
Altered Development of Amygdala-Connected Brain Regions in Males and Females with Autism.J Neurosci. 2022 Aug 3;42(31):6145-6155. doi: 10.1523/JNEUROSCI.0053-22.2022. Epub 2022 Jun 27. J Neurosci. 2022. PMID: 35760533 Free PMC article.
-
A multivariate distance-based analytic framework for connectome-wide association studies.Neuroimage. 2014 Jun;93 Pt 1(0 1):74-94. doi: 10.1016/j.neuroimage.2014.02.024. Epub 2014 Feb 28. Neuroimage. 2014. PMID: 24583255 Free PMC article.
-
The computational and neural substrates of moral strategies in social decision-making.Nat Commun. 2019 Apr 2;10(1):1483. doi: 10.1038/s41467-019-09161-6. Nat Commun. 2019. PMID: 30940815 Free PMC article.
-
Sex Differences in the Amygdala Resting-State Connectome of Children With Autism Spectrum Disorder.Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Mar;5(3):320-329. doi: 10.1016/j.bpsc.2019.08.004. Epub 2019 Aug 21. Biol Psychiatry Cogn Neurosci Neuroimaging. 2020. PMID: 31563470 Free PMC article.
References
-
- Anderson M. J. (2001). A new method for non-parametric multivariate analysis of variance. Austral Ecol. 26, 32–4610.1111/j.1442-9993.2001.01070.pp.x - DOI
-
- Donoho D. L. (2000). High-dimensional data analysis: the curses and blessings of dimensionality. Aide-Memoire of the Lecture in American Mathematical Society Conference: Math Challenges of 21st Century Available at: http://www.stat.stanford.edu/~donoho/Lectures/AMS2000/AMS2000.html
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
