A Novel Approach for Identifying Relevant Genes for Breast Cancer Survivability on Specific Therapies

Evol Bioinform Online. 2018 Aug 10:14:1176934318790266. doi: 10.1177/1176934318790266. eCollection 2018.

Abstract

Analyzing the genetic activity of breast cancer survival for a specific type of therapy provides a better understanding of the body response to the treatment and helps select the best course of action and while leading to the design of drugs based on gene activity. In this work, we use supervised and nonsupervised machine learning methods to deal with a multiclass classification problem in which we label the samples based on the combination of the 5-year survivability and treatment; we focus on hormone therapy, radiotherapy, and surgery. The proposed nonsupervised hierarchical models are created to find the highest separability between combinations of the classes. The supervised model consists of a combination of feature selection techniques and efficient classifiers used to find a potential set of biomarker genes specific to response to therapy. The results show that different models achieve different performance scores with accuracies ranging from 80.9% to 100%. We have investigated the roles of many biomarkers through the literature and found that some of the discriminative genes in the computational model such as ZC3H11A, VAX2, MAF1, and ZFP91 are related to breast cancer and other types of cancer.

Keywords: breast cancer; classification; feature selection; gene biomarkers; machine learning; survival; treatment therapy.

Publication types

  • Review