New framework for automated article selection applied to a literature review of Enhanced Biological Phosphorus Removal

PLoS One. 2019 May 9;14(5):e0216126. doi: 10.1371/journal.pone.0216126. eCollection 2019.

Abstract

Aims: Enhanced Biological Phosphorus Removal (EBPR) is a technology widely used in wastewater treatment to remove phosphorus (P) and prevent eutrophication. Establishing its operating efficiency and stability is an active research field that has generated almost 3000 publications in the last 40 years. Due to its size, including over 119 review articles, it is an example of a field where it becomes increasingly difficult to manually recognize its key research contributions, especially for non-experts or newcomers. Therefore, this work included two distinct but complementary objectives. First, to assemble for the first time a collection of bibliometric techniques into a framework for automating the article selection process when preparing a literature review (section 2). Second, to demonstrate it by applying it to the field of EBPR, producing a bibliometric analysis and a review of the key findings of EBPR research over time (section 3).

Findings: The joint analysis of citation networks, keywords, citation profiles, as well as of specific benchmarks for the identification of highly-cited publications revealed 12 research topics. Their content and evolution could be manually reviewed using a selection of articles consisting of approximately only 5% of the original set of publications. The largest topics addressed the identification of relevant microorganisms, the characterization of their metabolism, including denitrification and the competition between them (Clusters A-D). Emerging and influential topics, as determined by different citation indicators and temporal analysis, were related to volatile fatty acid production, P-recovery from waste activated sludge and aerobic granules for better process efficiency and stability (Clusters F-H).

Conclusions: The framework enabled key contributions in each of the constituent topics to be highlighted in a way that may have otherwise been biased by conventional citation-based ranking. Further, it reduced the need for manual input and a priori expertise compared to a traditional literature review. Hence, in an era of accelerated production of information and publications, this work contributed to the way that we are able to use computer-aided approaches to curate information and manage knowledge.

Publication types

  • Research Support, Non-U.S. Gov't

Grants and funding

MNQ acknowledges support from the Engineering and Physical Sciences Research Council (EPSRC, UK) in the form of his PhD studentship. TR acknowledges support from the Royal Society (UK). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.