Grouped gene selection and multi-classification of acute leukemia via new regularized multinomial regression

Gene. 2018 Aug 15:667:18-24. doi: 10.1016/j.gene.2018.05.012. Epub 2018 May 9.

Abstract

Diagnosing acute leukemia is the necessary prerequisite to treating it. Multi-classification on the gene expression data of acute leukemia is help for diagnosing it which contains B-cell acute lymphoblastic leukemia (BALL), T-cell acute lymphoblastic leukemia (TALL) and acute myeloid leukemia (AML). However, selecting cancer-causing genes is a challenging problem in performing multi-classification. In this paper, weighted gene co-expression networks are employed to divide the genes into groups. Based on the dividing groups, a new regularized multinomial regression with overlapping group lasso penalty (MROGL) has been presented to simultaneously perform multi-classification and select gene groups. By implementing this method on three-class acute leukemia data, the grouped genes which work synergistically are identified, and the overlapped genes shared by different groups are also highlighted. Moreover, MROGL outperforms other five methods on multi-classification accuracy.

Keywords: Acute leukemia; Grouped gene selection; Multi-classification.

MeSH terms

  • Computational Biology / methods*
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Neoplastic
  • Gene Regulatory Networks*
  • Genetic Predisposition to Disease
  • Humans
  • Leukemia, Myeloid / classification*
  • Leukemia, Myeloid / genetics
  • Leukemia, Myeloid / pathology
  • Leukemia, Myeloid, Acute / genetics
  • Leukemia, Myeloid, Acute / pathology
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / genetics
  • Precursor Cell Lymphoblastic Leukemia-Lymphoma / pathology
  • Precursor T-Cell Lymphoblastic Leukemia-Lymphoma / genetics
  • Precursor T-Cell Lymphoblastic Leukemia-Lymphoma / pathology
  • Regression Analysis