A novel approach for dimension reduction of microarray

Comput Biol Chem. 2017 Dec:71:161-169. doi: 10.1016/j.compbiolchem.2017.10.009. Epub 2017 Oct 28.

Abstract

This paper proposes a new hybrid search technique for feature (gene) selection (FS) using Independent component analysis (ICA) and Artificial Bee Colony (ABC) called ICA+ABC, to select informative genes based on a Naïve Bayes (NB) algorithm. An important trait of this technique is the optimization of ICA feature vector using ABC. ICA+ABC is a hybrid search algorithm that combines the benefits of extraction approach, to reduce the size of data and wrapper approach, to optimize the reduced feature vectors. This hybrid search technique is facilitated by evaluating the performance of ICA+ABC on six standard gene expression datasets of classification. Extensive experiments were conducted to compare the performance of ICA+ABC with the results obtained from recently published Minimum Redundancy Maximum Relevance (mRMR) +ABC algorithm for NB classifier. Also to check the performance that how ICA+ABC works as feature selection with NB classifier, compared the combination of ICA with popular filter techniques and with other similar bio inspired algorithm such as Genetic Algorithm (GA) and Particle Swarm Optimization (PSO). The result shows that ICA+ABC has a significant ability to generate small subsets of genes from the ICA feature vector, that significantly improve the classification accuracy of NB classifier compared to other previously suggested methods.

Keywords: Artificial bee colony (ABC); Cancer classification; Feature selection (FS); Independent component analysis (ICA); Naïve bayes (NB).

MeSH terms

  • Algorithms*
  • Bayes Theorem*
  • Gene Expression
  • Humans
  • Oligonucleotide Array Sequence Analysis*