Identification of prognostic gene biomarkers for metastatic skin cancer using data mining

Biomed Rep. 2020 Jul;13(1):22-30. doi: 10.3892/br.2020.1307. Epub 2020 May 18.

Abstract

Skin cancer is a common malignant tumor in China and throughout the world, and the rate of recurrence is considerably high, thus endangering the quality of life and health of patients, and increasing the economic burden and pressure to the families of those afflicted. Due to the limitations of traditional drug treatments, it is difficult to achieve the desired therapeutic effect of complete removal. However, targeted gene therapy may be a novel means of treating skin cancer, as the targeted nature of treatment may improve therapeutic outcomes. However, targeted gene therapy requires physicians to select the appropriate gene, which means suitable genetic biomarkers must be identified from complex genetic data. In the present study, the least absolute shrinkage and selection operator regression analysis method was used with 10-fold cross verification to reduce the dimensions of gene data in patients with skin cancer, and subsequently, 20 gene biomarkers were screened. A prognostic model was constructed using these 20 gene biomarkers, and the validity of the model was assessed using a training set and a verification set, which showed that the model performed well. Finally, gene function analysis of these 20 gene biomarkers was determined. Relevant studies were found to show that the genetic biomarkers identified in this paper may possess value for the follow-up clinical treatment of skin cancer.

Keywords: data mining; gene biomarker; least absolute shrinkage and selection operator; metastatic; prognosis; skin cancer.