Relief-based feature selection: Introduction and review
- PMID: 30031057
- PMCID: PMC6299836
- DOI: 10.1016/j.jbi.2018.07.014
Relief-based feature selection: Introduction and review
Abstract
Feature selection plays a critical role in biomedical data mining, driven by increasing feature dimensionality in target problems and growing interest in advanced but computationally expensive methodologies able to model complex associations. Specifically, there is a need for feature selection methods that are computationally efficient, yet sensitive to complex patterns of association, e.g. interactions, so that informative features are not mistakenly eliminated prior to downstream modeling. This paper focuses on Relief-based algorithms (RBAs), a unique family of filter-style feature selection algorithms that have gained appeal by striking an effective balance between these objectives while flexibly adapting to various data characteristics, e.g. classification vs. regression. First, this work broadly examines types of feature selection and defines RBAs within that context. Next, we introduce the original Relief algorithm and associated concepts, emphasizing the intuition behind how it works, how feature weights generated by the algorithm can be interpreted, and why it is sensitive to feature interactions without evaluating combinations of features. Lastly, we include an expansive review of RBA methodological research beyond Relief and its popular descendant, ReliefF. In particular, we characterize branches of RBA research, and provide comparative summaries of RBA algorithms including contributions, strategies, functionality, time complexity, adaptation to key data characteristics, and software availability.
Keywords: Epistasis; Feature interaction; Feature selection; Feature weighting; Filter; ReliefF.
Copyright © 2018 Elsevier Inc. All rights reserved.
Figures
Similar articles
-
Benchmarking relief-based feature selection methods for bioinformatics data mining.J Biomed Inform. 2018 Sep;85:168-188. doi: 10.1016/j.jbi.2018.07.015. Epub 2018 Jul 17. J Biomed Inform. 2018. PMID: 30030120 Free PMC article.
-
A Hybrid Feature Selection Method Based on Binary State Transition Algorithm and ReliefF.IEEE J Biomed Health Inform. 2019 Sep;23(5):1888-1898. doi: 10.1109/JBHI.2018.2872811. Epub 2018 Sep 28. IEEE J Biomed Health Inform. 2019. PMID: 30281502
-
Feature selection and nearest centroid classification for protein mass spectrometry.BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68. BMC Bioinformatics. 2005. PMID: 15788095 Free PMC article.
-
Feature selection methods for big data bioinformatics: A survey from the search perspective.Methods. 2016 Dec 1;111:21-31. doi: 10.1016/j.ymeth.2016.08.014. Epub 2016 Aug 31. Methods. 2016. PMID: 27592382 Review.
-
A review of feature selection methods in medical applications.Comput Biol Med. 2019 Sep;112:103375. doi: 10.1016/j.compbiomed.2019.103375. Epub 2019 Jul 31. Comput Biol Med. 2019. PMID: 31382212 Review.
Cited by
-
Magnetic resonance imaging based on radiomics for differentiating T1-category nasopharyngeal carcinoma from nasopharyngeal lymphoid hyperplasia: a multicenter study.Jpn J Radiol. 2024 Feb 27. doi: 10.1007/s11604-024-01544-0. Online ahead of print. Jpn J Radiol. 2024. PMID: 38409300
-
KNCFS: Feature selection for high-dimensional datasets based on improved random multi-subspace learning.PLoS One. 2024 Feb 23;19(2):e0296108. doi: 10.1371/journal.pone.0296108. eCollection 2024. PLoS One. 2024. PMID: 38394325 Free PMC article.
-
Classification of Multiple H&E Images via an Ensemble Computational Scheme.Entropy (Basel). 2023 Dec 28;26(1):34. doi: 10.3390/e26010034. Entropy (Basel). 2023. PMID: 38248160 Free PMC article.
-
Digital image analysis and machine learning-assisted prediction of neoadjuvant chemotherapy response in triple-negative breast cancer.Breast Cancer Res. 2024 Jan 18;26(1):12. doi: 10.1186/s13058-023-01752-y. Breast Cancer Res. 2024. PMID: 38238771 Free PMC article.
-
Classification of Game Demand and the Presence of Experimental Pain Using Functional Near-Infrared Spectroscopy.Front Neuroergon. 2021 Dec 21;2:695309. doi: 10.3389/fnrgo.2021.695309. eCollection 2021. Front Neuroergon. 2021. PMID: 38235227 Free PMC article.
References
-
- Agre G, Dzhondzhorov A, 2016. A weighted feature selection method for instance-based classification. In: International Conference on Artificial Intelligence: Methodology, Systems, and Applications Springer, pp. 14–25.
-
- Aha DW, Kibler D, Albert MK, 1991. Instance-based learning algorithms. Machine learning 6 (1), 37–66.
-
- Almuallim H, Dietterich TG, 1991. Learning with many irrelevant features. In: AAAI. Vol. 91 pp. 547–552.
-
- Arauzo-Azofra A, Benitez JM, Castro JL, 2004. A feature set measure based on relief. In: Proceedings of the fifth international conference on Recent Advances in Soft Computing pp. 104–109.
-
- Belanche LA, Gonz´alez FF, 2011. Review and evaluation of feature selection algorithms in synthetic problems. arXiv preprint arXiv:1101.2320.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
