A clustering approach for topic filtering within systematic literature reviews

MethodsX. 2020 Feb 22:7:100831. doi: 10.1016/j.mex.2020.100831. eCollection 2020.

Abstract

Within a systematic literature review (SLR), researchers are confronted with vast amounts of articles from scientific databases, which have to be manually evaluated regarding their relevance for a certain field of observation. The evaluation and filtering phase of prevalent SLR methodologies is therefore time consuming and hardly expressible to the intended audience. The proposed method applies natural language processing (NLP) on article meta data and a k-means clustering algorithm to automatically convert large article corpora into a distribution of focal topics. This allows efficient filtering as well as objectifying the process through the discussion of the clustering results. Beyond that, it allows to quickly identify scientific communities and therefore provides an iterative perspective for the so far linear SLR methodology.•NLP and k-means clustering to filter large article corpora during systematic literature reviews.•Automated clustering allows filtering very efficiently as well as effectively compared to manual selection.•Presentation and discussion of the clustering results helps to objectify the nontransparent filtering step in systematic literature reviews.

Keywords: Clustering; Literature filtering; Systematic literature review.