Fuzzy Gaussian Lasso clustering with application to cancer data

Math Biosci Eng. 2019 Sep 30;17(1):250-265. doi: 10.3934/mbe.2020014.

Abstract

Recently, Yang et al. (2019) proposed a fuzzy model-based Gaussian (F-MB-Gauss) clustering that combines a model-based Gaussian with fuzzy membership functions for clustering. In this paper, we further consider the F-MB-Gauss clustering with the least absolute shrinkage and selection operator (Lasso) for feature (variable) selection, termed a fuzzy Gaussian Lasso (FG-Lasso) clustering algorithm. We demonstrate that the proposed FG-Lasso is a good clustering algorithm with better choice for feature subset selection. Experimental results and comparisons actually present these good aspects of the proposed FG-Lasso clustering algorithm. Cancer is a disease with growth of abnormal cells in a body. WHO reported that it is the first or second main leading cause of death. It spreads and affects the other parts of body if there is not properly diagnosed. In the paper, we apply the proposed FG-Lasso to cancer data with good feature selection and clustering results.

Keywords: Fuzzy Gaussian Lasso (FG-Lasso) clustering; Lasso; feature selection; fuzzy model-based Gaussian; fuzzy sets; model-based clustering.

MeSH terms

  • Algorithms
  • Blood Glucose / analysis
  • Breast Neoplasms / diagnosis
  • Cluster Analysis*
  • Colonic Neoplasms / diagnosis
  • Female
  • Fuzzy Logic*
  • Glucose Tolerance Test
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Leukemia / diagnosis
  • Male
  • Models, Statistical
  • Neoplasms / diagnosis*
  • Normal Distribution
  • Pattern Recognition, Automated
  • Signal Processing, Computer-Assisted

Substances

  • Blood Glucose