A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis
- PMID: 19377034
- PMCID: PMC2697346
- DOI: 10.1093/biostatistics/kxp008
A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis
Abstract
We present a penalized matrix decomposition (PMD), a new framework for computing a rank-K approximation for a matrix. We approximate the matrix X as circumflexX = sigma(k=1)(K) d(k)u(k)v(k)(T), where d(k), u(k), and v(k) minimize the squared Frobenius norm of X - circumflexX, subject to penalties on u(k) and v(k). This results in a regularized version of the singular value decomposition. Of particular interest is the use of L(1)-penalties on u(k) and v(k), which yields a decomposition of X using sparse vectors. We show that when the PMD is applied using an L(1)-penalty on v(k) but not on u(k), a method for sparse principal components results. In fact, this yields an efficient algorithm for the "SCoTLASS" proposal (Jolliffe and others 2003) for obtaining sparse principal components. This method is demonstrated on a publicly available gene expression data set. We also establish connections between the SCoTLASS method for sparse principal component analysis and the method of Zou and others (2006). In addition, we show that when the PMD is applied to a cross-products matrix, it results in a method for penalized canonical correlation analysis (CCA). We apply this penalized CCA method to simulated data and to a genomic data set consisting of gene expression and DNA copy number measurements on the same set of samples.
Figures
. The constraints ‖u‖1 = 1 and ‖u‖1=
are shown using dashed lines. Right: The L2- and L1-constraints on u are shown for some c between 1 and
. Small circles indicate the points where both the L1- and the L2-constraints are active. The solid arcs indicate the solutions that occur when Δ1 = 0 in Algorithm 3. The figure shows that in 2D, the points where both the L1- and L2-constraints are active do not have either u1 or u2 equal to 0.
Similar articles
-
Sparse canonical correlation analysis from a predictive point of view.Biom J. 2015 Sep;57(5):834-51. doi: 10.1002/bimj.201400226. Epub 2015 Jul 6. Biom J. 2015. PMID: 26147637
-
Extensions of sparse canonical correlation analysis with applications to genomic data.Stat Appl Genet Mol Biol. 2009;8(1):Article28. doi: 10.2202/1544-6115.1470. Epub 2009 Jun 9. Stat Appl Genet Mol Biol. 2009. PMID: 19572827 Free PMC article. Review.
-
An iterative penalized least squares approach to sparse canonical correlation analysis.Biometrics. 2019 Sep;75(3):734-744. doi: 10.1111/biom.13043. Epub 2019 Apr 9. Biometrics. 2019. PMID: 30714093
-
Robust Principal Component Analysis Regularized by Truncated Nuclear Norm for Identifying Differentially Expressed Genes.IEEE Trans Nanobioscience. 2017 Sep;16(6):447-454. doi: 10.1109/TNB.2017.2723439. Epub 2017 Jul 4. IEEE Trans Nanobioscience. 2017. PMID: 28692983
-
Clutter suppression in ultrasound: performance evaluation and review of low-rank and sparse matrix decomposition methods.Biomed Eng Online. 2020 May 28;19(1):37. doi: 10.1186/s12938-020-00778-z. Biomed Eng Online. 2020. PMID: 32466753 Free PMC article. Review.
Cited by
-
Social optimism biases are associated with cortical thickness.Soc Cogn Affect Neurosci. 2020 Sep 24;15(7):745-754. doi: 10.1093/scan/nsaa095. Soc Cogn Affect Neurosci. 2020. PMID: 32685967 Free PMC article.
-
Biclustering with heterogeneous variance.Proc Natl Acad Sci U S A. 2013 Jul 23;110(30):12253-8. doi: 10.1073/pnas.1304376110. Epub 2013 Jul 8. Proc Natl Acad Sci U S A. 2013. PMID: 23836637 Free PMC article.
-
WAVELET-DOMAIN REGRESSION AND PREDICTIVE INFERENCE IN PSYCHIATRIC NEUROIMAGING.Ann Appl Stat. 2015 Jun;9(2):1076-1101. doi: 10.1214/15-AOAS829. Epub 2015 Jul 20. Ann Appl Stat. 2015. PMID: 27330652 Free PMC article.
-
Psychiatric Neural Networks and Precision Therapeutics by Machine Learning.Biomedicines. 2021 Apr 8;9(4):403. doi: 10.3390/biomedicines9040403. Biomedicines. 2021. PMID: 33917863 Free PMC article. Review.
-
Bootstrap Evaluation of Association Matrices (BEAM) for Integrating Multiple Omics Profiles with Multiple Outcomes.bioRxiv [Preprint]. 2024 Aug 3:2024.07.31.605805. doi: 10.1101/2024.07.31.605805. bioRxiv. 2024. PMID: 39131398 Free PMC article. Preprint.
References
-
- Boyd S, Vandenberghe L. Convex Optimization. New York: Cambridge University Press; 2004.
-
- Chin K, DeVries S, Fridlyand J, Spellman P, Roydasgupta R, Kuo W-L, Lapuk A, Neve R, Qian Z, Ryder T. Genomic and transcriptional aberrations linked to breast cancer pathophysiologies. Cancer Cell. 2006;10:529–541. and others. - PubMed
-
- Dudoit S, Fridlyand J, Speed T. Comparison of discrimination methods for the classification of tumors using gene expression data. Journal of the American Statistical Association. 2001;96:1151–1160.
-
- Eckart C, Young G. The approximation of one matrix by another of low rank. Psychometrika. 1936;1:211.
-
- Friedman J, Hastie T, Hoefling H, Tibshirani R. Pathwise coordinate optimization. Annals of Applied Statistics. 2007;1:302–332.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
