Computational advances of tumor marker selection and sample classification in cancer proteomics
- PMID: 32802273
- PMCID: PMC7403885
- DOI: 10.1016/j.csbj.2020.07.009
Computational advances of tumor marker selection and sample classification in cancer proteomics
Abstract
Cancer proteomics has become a powerful technique for characterizing the protein markers driving transformation of malignancy, tracing proteome variation triggered by therapeutics, and discovering the novel targets and drugs for the treatment of oncologic diseases. To facilitate cancer diagnosis/prognosis and accelerate drug target discovery, a variety of methods for tumor marker identification and sample classification have been developed and successfully applied to cancer proteomic studies. This review article describes the most recent advances in those various approaches together with their current applications in cancer-related studies. Firstly, a number of popular feature selection methods are overviewed with objective evaluation on their advantages and disadvantages. Secondly, these methods are grouped into three major classes based on their underlying algorithms. Finally, a variety of sample separation algorithms are discussed. This review provides a comprehensive overview of the advances on tumor maker identification and patients/samples/tissues separations, which could be guidance to the researches in cancer proteomics.
Keywords: ANN, Artificial Neural Network; ANOVA, Analysis of Variance; CFS, Correlation-based Feature Selection; Cancer proteomics; Computational methods; DAPC, Discriminant Analysis of Principal Component; DT, Decision Trees; EDA, Estimation of Distribution Algorithm; FC, Fold Change; GA, Genetic Algorithms; GR, Gain Ratio; HC, Hill Climbing; HCA, Hierarchical Cluster Analysis; IG, Information Gain; LDA, Linear Discriminant Analysis; LIMMA, Linear Models for Microarray Data; MBF, Markov Blanket Filter; MWW, Mann–Whitney–Wilcoxon test; OPLS-DA, Orthogonal Partial Least Squares Discriminant Analysis; PCA, Principal Component Analysis; PLS-DA, Partial Least Square Discriminant Analysis; RF, Random Forest; RF-RFE, Random Forest with Recursive Feature Elimination; SA, Simulated Annealing; SAM, Significance Analysis of Microarrays; SBE, Sequential Backward Elimination; SFS, and Sequential Forward Selection; SOM, Self-organizing Map; SU, Symmetrical Uncertainty; SVM, Support Vector Machine; SVM-RFE, Support Vector Machine with Recursive Feature Elimination; Sample classification; Tumor marker selection; sPLSDA, Sparse Partial Least Squares Discriminant Analysis; t-SNE, Student t Distribution; χ2, Chi-square.
© 2020 The Authors.
Figures
Similar articles
-
A critical assessment of feature selection methods for biomarker discovery in clinical proteomics.Mol Cell Proteomics. 2013 Jan;12(1):263-76. doi: 10.1074/mcp.M112.022566. Epub 2012 Oct 31. Mol Cell Proteomics. 2013. PMID: 23115301 Free PMC article.
-
An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018. Biomed Res Int. 2018. PMID: 30228989 Free PMC article.
-
Improving PLS-RFE based gene selection for microarray data classification.Comput Biol Med. 2015 Jul;62:14-24. doi: 10.1016/j.compbiomed.2015.04.011. Epub 2015 Apr 17. Comput Biol Med. 2015. PMID: 25912984
-
Application of High Resolution Mass Spectrometric methods coupled with chemometric techniques in olive oil authenticity studies - A review.Anal Chim Acta. 2020 Oct 16;1134:150-173. doi: 10.1016/j.aca.2020.07.029. Epub 2020 Jul 30. Anal Chim Acta. 2020. PMID: 33059861 Review.
-
A tutorial review: Metabolomics and partial least squares-discriminant analysis--a marriage of convenience or a shotgun wedding.Anal Chim Acta. 2015 Jun 16;879:10-23. doi: 10.1016/j.aca.2015.02.012. Epub 2015 Feb 11. Anal Chim Acta. 2015. PMID: 26002472 Review.
Cited by
-
Proteomic and computational analyses followed by functional validation of protective effects of trigonelline against calcium oxalate-induced renal cell deteriorations.Comput Struct Biotechnol J. 2023 Nov 22;21:5851-5867. doi: 10.1016/j.csbj.2023.11.036. eCollection 2023. Comput Struct Biotechnol J. 2023. PMID: 38074474 Free PMC article.
-
Proteomics and metabolomics analyses of Streptococcus agalactiae isolates from human and animal sources.Sci Rep. 2023 Nov 28;13(1):20980. doi: 10.1038/s41598-023-47976-y. Sci Rep. 2023. PMID: 38017083 Free PMC article.
-
Comprehensive analysis of basement membrane and immune checkpoint related lncRNA and its prognostic value in hepatocellular carcinoma via machine learning.Heliyon. 2023 Sep 27;9(10):e20462. doi: 10.1016/j.heliyon.2023.e20462. eCollection 2023 Oct. Heliyon. 2023. PMID: 37810862 Free PMC article.
-
Identifying potential biomarkers of idiopathic pulmonary fibrosis through machine learning analysis.Sci Rep. 2023 Oct 2;13(1):16559. doi: 10.1038/s41598-023-43834-z. Sci Rep. 2023. PMID: 37783761 Free PMC article.
-
Functional Outcomes of Patients with Primary Brain Tumors Undergoing Inpatient Rehabilitation at a Tertiary Care Rehabilitation Facility in Saudi Arabia.Int J Environ Res Public Health. 2023 Mar 7;20(6):4679. doi: 10.3390/ijerph20064679. Int J Environ Res Public Health. 2023. PMID: 36981589 Free PMC article.
References
-
- Malvezzi M., Carioli G., Bertuccio P., Negri E., La Vecchia C. Relation between mortality trends of cardiovascular diseases and selected cancers in the European Union, in 1970–2017. Focus on cohort and period effects. Eur J Cancer. 2018;103:341–355. - PubMed
-
- Arora D., Chaudhary R., Singh A. System biology approach to identify potential receptor for targeting cancer and biomolecular interaction studies of indole[2,1-a]isoquinoline derivative as anticancerous drug candidate against it. Interdiscip Sci Comput Life Sci. 2017;11:125–134. - PubMed
Publication types
LinkOut - more resources
Full Text Sources