Machine learning for screening prioritization in systematic reviews: comparative performance of Abstrackr and EPPI-Reviewer
- PMID: 32241297
- PMCID: PMC7118839
- DOI: 10.1186/s13643-020-01324-7
Machine learning for screening prioritization in systematic reviews: comparative performance of Abstrackr and EPPI-Reviewer
Abstract
Background: Improving the speed of systematic review (SR) development is key to supporting evidence-based medicine. Machine learning tools which semi-automate citation screening might improve efficiency. Few studies have assessed use of screening prioritization functionality or compared two tools head to head. In this project, we compared performance of two machine-learning tools for potential use in citation screening.
Methods: Using 9 evidence reports previously completed by the ECRI Institute Evidence-based Practice Center team, we compared performance of Abstrackr and EPPI-Reviewer, two off-the-shelf citations screening tools, for identifying relevant citations. Screening prioritization functionality was tested for 3 large reports and 6 small reports on a range of clinical topics. Large report topics were imaging for pancreatic cancer, indoor allergen reduction, and inguinal hernia repair. We trained Abstrackr and EPPI-Reviewer and screened all citations in 10% increments. In Task 1, we inputted whether an abstract was ordered for full-text screening; in Task 2, we inputted whether an abstract was included in the final report. For both tasks, screening continued until all studies ordered and included for the actual reports were identified. We assessed potential reductions in hypothetical screening burden (proportion of citations screened to identify all included studies) offered by each tool for all 9 reports.
Results: For the 3 large reports, both EPPI-Reviewer and Abstrackr performed well with potential reductions in screening burden of 4 to 49% (Abstrackr) and 9 to 60% (EPPI-Reviewer). Both tools had markedly poorer performance for 1 large report (inguinal hernia), possibly due to its heterogeneous key questions. Based on McNemar's test for paired proportions in the 3 large reports, EPPI-Reviewer outperformed Abstrackr for identifying articles ordered for full-text review, but Abstrackr performed better in 2 of 3 reports for identifying articles included in the final report. For small reports, both tools provided benefits but EPPI-Reviewer generally outperformed Abstrackr in both tasks, although these results were often not statistically significant.
Conclusions: Abstrackr and EPPI-Reviewer performed well, but prioritization accuracy varied greatly across reports. Our work suggests screening prioritization functionality is a promising modality offering efficiency gains without giving up human involvement in the screening process.
Keywords: Abstrackr; Citation screening; EPPI-Reviewer; Efficiency; Machine learning; Methodology; Screening burden; Screening prioritization; Text-mining.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures
Similar articles
-
Technology-assisted title and abstract screening for systematic reviews: a retrospective evaluation of the Abstrackr machine learning tool.Syst Rev. 2018 Mar 12;7(1):45. doi: 10.1186/s13643-018-0707-8. Syst Rev. 2018. PMID: 29530097 Free PMC article.
-
Evaluation of text mining to reduce screening workload for injury-focused systematic reviews.Inj Prev. 2020 Feb;26(1):55-60. doi: 10.1136/injuryprev-2019-043247. Epub 2019 Aug 26. Inj Prev. 2020. PMID: 31451565
-
A text-mining tool generated title-abstract screening workload savings: performance evaluation versus single-human screening.J Clin Epidemiol. 2022 Sep;149:53-59. doi: 10.1016/j.jclinepi.2022.05.017. Epub 2022 May 30. J Clin Epidemiol. 2022. PMID: 35654270
-
Evaluating the efficacy of artificial intelligence tools for the automation of systematic reviews in cancer research: A systematic review.Cancer Epidemiol. 2024 Feb;88:102511. doi: 10.1016/j.canep.2023.102511. Epub 2023 Dec 9. Cancer Epidemiol. 2024. PMID: 38071872 Review.
-
Assessing the Accuracy of Machine-Assisted Abstract Screening With DistillerAI: A User Study [Internet].Rockville (MD): Agency for Healthcare Research and Quality (US); 2019 Nov. Report No.: 19(20)-EHC026-EF. Rockville (MD): Agency for Healthcare Research and Quality (US); 2019 Nov. Report No.: 19(20)-EHC026-EF. PMID: 31804782 Free Books & Documents. Review.
Cited by
-
Boosting efficiency in a clinical literature surveillance system with LightGBM.PLOS Digit Health. 2024 Sep 23;3(9):e0000299. doi: 10.1371/journal.pdig.0000299. eCollection 2024 Sep. PLOS Digit Health. 2024. PMID: 39312500 Free PMC article.
-
An exploration of available methods and tools to improve the efficiency of systematic review production: a scoping review.BMC Med Res Methodol. 2024 Sep 18;24(1):210. doi: 10.1186/s12874-024-02320-4. BMC Med Res Methodol. 2024. PMID: 39294580 Free PMC article. Review.
-
Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed.Syst Rev. 2024 Jul 8;13(1):174. doi: 10.1186/s13643-024-02592-3. Syst Rev. 2024. PMID: 38978132 Free PMC article.
-
Mapping the scientific knowledge and approaches to defining and measuring hate crime, hate speech, and hate incidents: A systematic review.Campbell Syst Rev. 2024 Apr 28;20(2):e1397. doi: 10.1002/cl2.1397. eCollection 2024 Jun. Campbell Syst Rev. 2024. PMID: 38686101 Free PMC article. Review.
-
The SAFE procedure: a practical stopping heuristic for active learning-based screening in systematic reviews and meta-analyses.Syst Rev. 2024 Mar 1;13(1):81. doi: 10.1186/s13643-024-02502-7. Syst Rev. 2024. PMID: 38429798 Free PMC article.
References
-
- Sackett, David R. Evidence-based medicine: how to practice and teach EBM, 2nd Edition: By David L. Sackett, Sharon E. Straus, W. Scott Richardson, William Rosenberg, and R. Brian Haynes, Churchill Livingstone, 2000. Vol. 16. 2001 [cited 2019 Jul 18]. Available from: 10.1177/088506660101600307.
-
- Committee on Standards for Systematic Reviews, Institute of Medicine . Finding what works in health care: standards for systematic reviews. 2011. - PubMed
-
- Institute of Medicine (US) Committee on Standards for Developing Trustworthy Clinical Practice Guidelines . In: Clinical practice guidelines we can trust. Graham R, Mancher M, Miller Wolman D, Greenfield S, Steinberg E, editors. Washington: National Academies Press (US); 2011. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials
