Clinicians struggle to accurately classify biliary strictures as benign or malignant. Current endoscopic retrograde cholangiopancreatography (ERCP)-based sampling modalities including brush cytology and forceps biopsy have poor sensitivity for pathologic confirmation of malignancy. Cholangioscopy allows for direct visualization and sampling of biliary pathology; however, this technology is also associated with inaccurate classification of biliary disease. Previously, an artificial intelligence (AI) system that analyzes cholangioscopy footage was found to be more accurate in diagnosing biliary malignancy than ERCP sampling techniques. The aim of this study was to validate this AI system on a new series of examinations.Three academic centers collected all available unedited cholangioscopy recordings. The videos were processed by the cholangioscopy AI system. After analyzing videos, the AI system provided predictions as to whether malignancy was present. AI performance in classifying strictures was compared with the performance of brush cytology and forceps biopsy.112 cholangioscopy examinations (containing 4 817 081 images) were generated from 99 patients. Of those examinations, 61 (54.5%) were for investigation of biliary strictures (31 [50.8%] benign, 30 [49.2%] malignant). For the correct classification of strictures, the AI system was 80.0% sensitive and 90.3% specific. It was also significantly more accurate for stricture classification (85.2%) than brush cytology (52.5%; P<0.001), forceps biopsy (68.2%; P=0.04), and the combination of brush cytology and forceps biopsy (66.7%; P=0.02).A previously developed cholangioscopy AI system was found to continually outperform standard ERCP sampling modalities for accurate identification of malignancy, without additional retraining, in a multicenter validation cohort.
Thieme. All rights reserved.