Canonical correlation analysis for multilabel classification: a least-squares formulation, extensions, and analysis
- PMID: 20733223
- DOI: 10.1109/TPAMI.2010.160
Canonical correlation analysis for multilabel classification: a least-squares formulation, extensions, and analysis
Abstract
Canonical Correlation Analysis (CCA) is a well-known technique for finding the correlations between two sets of multidimensional variables. It projects both sets of variables onto a lower-dimensional space in which they are maximally correlated. CCA is commonly applied for supervised dimensionality reduction in which the two sets of variables are derived from the data and the class labels, respectively. It is well-known that CCA can be formulated as a least-squares problem in the binary class case. However, the extension to the more general setting remains unclear. In this paper, we show that under a mild condition which tends to hold for high-dimensional data, CCA in the multilabel case can be formulated as a least-squares problem. Based on this equivalence relationship, efficient algorithms for solving least-squares problems can be applied to scale CCA to very large data sets. In addition, we propose several CCA extensions, including the sparse CCA formulation based on the 1-norm regularization. We further extend the least-squares formulation to partial least squares. In addition, we show that the CCA projection for one set of variables is independent of the regularization on the other set of multidimensional variables, providing new insights on the effect of regularization on CCA. We have conducted experiments using benchmark data sets. Experiments on multilabel data sets confirm the established equivalence relationships. Results also demonstrate the effectiveness and efficiency of the proposed CCA extensions.
Similar articles
-
A regularized kernel CCA contrast function for ICA.Neural Netw. 2008 Mar-Apr;21(2-3):170-81. doi: 10.1016/j.neunet.2007.12.047. Epub 2008 Jan 10. Neural Netw. 2008. PMID: 18280110
-
Canonical dependency analysis based on squared-loss mutual information.Neural Netw. 2012 Oct;34:46-55. doi: 10.1016/j.neunet.2012.06.009. Epub 2012 Jul 11. Neural Netw. 2012. PMID: 22831849
-
A learning algorithm for adaptive canonical correlation analysis of several data sets.Neural Netw. 2007 Jan;20(1):139-52. doi: 10.1016/j.neunet.2006.09.011. Epub 2006 Nov 17. Neural Netw. 2007. PMID: 17113263
-
Extensions of sparse canonical correlation analysis with applications to genomic data.Stat Appl Genet Mol Biol. 2009;8(1):Article28. doi: 10.2202/1544-6115.1470. Epub 2009 Jun 9. Stat Appl Genet Mol Biol. 2009. PMID: 19572827 Free PMC article. Review.
-
Canonical Correlation Analysis and Partial Least Squares for Identifying Brain-Behavior Associations: A Tutorial and a Comparative Study.Biol Psychiatry Cogn Neurosci Neuroimaging. 2022 Nov;7(11):1055-1067. doi: 10.1016/j.bpsc.2022.07.012. Epub 2022 Aug 8. Biol Psychiatry Cogn Neurosci Neuroimaging. 2022. PMID: 35952973 Review.
Cited by
-
Multi-information improves the performance of CCA-based SSVEP classification.Cogn Neurodyn. 2024 Feb;18(1):165-172. doi: 10.1007/s11571-022-09923-x. Epub 2023 Jan 9. Cogn Neurodyn. 2024. PMID: 38406193
-
Preference matrix guided sparse canonical correlation analysis for mining brain imaging genetic associations in Alzheimer's disease.Methods. 2023 Oct;218:27-38. doi: 10.1016/j.ymeth.2023.07.007. Epub 2023 Jul 27. Methods. 2023. PMID: 37507059
-
Preference Matrix Guided Sparse Canonical Correlation Analysis for Genetic Study of Quantitative Traits in Alzheimer's Disease.Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:541-548. doi: 10.1109/bibm55620.2022.9995342. Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022. PMID: 36845995 Free PMC article.
-
Improved Principal Component Analysis (IPCA): A Novel Method for Quantitative Calibration Transfer between Different Near-Infrared Spectrometers.Molecules. 2023 Jan 3;28(1):406. doi: 10.3390/molecules28010406. Molecules. 2023. PMID: 36615595 Free PMC article.
-
Multi-user motion recognition using sEMG via discriminative canonical correlation analysis and adaptive dimensionality reduction.Front Neurorobot. 2022 Oct 28;16:997134. doi: 10.3389/fnbot.2022.997134. eCollection 2022. Front Neurorobot. 2022. PMID: 36386392 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
