Setting: Tuberculosis (TB) prevalence survey in Zambia between 2013 and 2014.
Objective: To compare the performance of automatic software (CAD4TB 5) in chest X-ray (CXR) reading with that of field (general practitioners) and central (radiologists) readers.
Design: A retrospective study comparing the performance of human and automatic reading was conducted. Two scenarios for central reading were evaluated: abnormalities not consistent with TB were considered to be 'normal' or 'abnormal'. Sputum culture was defined as the reference standard. Measures derived from receiver operating characteristic analysis were used to assess readers' performances.
Results: Of 46 099 participants, 23 838 cases included all survey information; of these, 106 cases were culture-confirmed TB-positive. The performance of CAD4TB 5 was similar to that of field and central readers. Although there were significant differences in specificity when compared with field readings (P = 0.002) and central readings considering the first scenario (P < 0.001), these differences were not substantial (93.2% vs. 92.6% and 98.4% vs. 99.6%, respectively).CONCLUSIONp: The performance of automatic CXR readings is comparable with that of human experts in a TB prevalence survey setting using culture as reference.