Detection of trachoma using machine learning approaches

Damien Socia; Christopher J Brady; Sheila K West; R Chase Cockrell

doi:10.1371/journal.pntd.0010943

Detection of trachoma using machine learning approaches

PLoS Negl Trop Dis. 2022 Dec 7;16(12):e0010943. doi: 10.1371/journal.pntd.0010943. eCollection 2022 Dec.

Authors

Damien Socia¹, Christopher J Brady^{1

2}, Sheila K West³, R Chase Cockrell¹

Affiliations

¹ Division of Surgical Research, Department of Surgery, Larner College of Medicine, University of Vermont, Burlington, Vermont, United States of America.
² Division of Ophthalmology, Department of Surgery, Larner College of Medicine, University of Vermont, Burlington, Vermont, United States of America.
³ Dana Center for Preventive Ophthalmology, Wilmer Eye Institute, Baltimore, Maryland, United States of America.

Abstract

Background: Though significant progress in disease elimination has been made over the past decades, trachoma is the leading infectious cause of blindness globally. Further efforts in trachoma elimination are paradoxically being limited by the relative rarity of the disease, which makes clinical training for monitoring surveys difficult. In this work, we evaluate the plausibility of an Artificial Intelligence model to augment or replace human image graders in the evaluation/diagnosis of trachomatous inflammation-follicular (TF).

Methods: We utilized a dataset consisting of 2300 images with a 5% positivity rate for TF. We developed classifiers by implementing two state-of-the-art Convolutional Neural Network architectures, ResNet101 and VGG16, and applying a suite of data augmentation/oversampling techniques to the positive images. We then augmented our data set with additional images from independent research groups and evaluated performance.

Results: Models performed well in minimizing the number of false negatives, given the constraint of the low numbers of images in which TF was present. The best performing models achieved a sensitivity of 95% and positive predictive value of 50-70% while reducing the number images requiring skilled grading by 66-75%. Basic oversampling and data augmentation techniques were most successful at improving model performance, while techniques that are grounded in clinical experience, such as highlighting follicles, were less successful.

Discussion: The developed models perform well and significantly reduce the burden on graders by minimizing the number of false negative identifications. Further improvements in model skill will benefit from data sets with more TF as well as a range in image quality and image capture techniques used. While these models approach/meet the community-accepted standard for skilled field graders (i.e., Cohen's Kappa >0.7), they are insufficient to be deployed independently/clinically at this time; rather, they can be utilized to significantly reduce the burden on skilled image graders.

Copyright: © 2022 Socia et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Artificial Intelligence
Humans
Machine Learning
Neural Networks, Computer
Predictive Value of Tests
Trachoma* / diagnosis

Associated data

figshare/10.6084/m9.figshare.7551053

Grants and funding

P20 GM125498/GM/NIGMS NIH HHS/United States