Objective: The presence of calcification, especially microcalcification, is often associated with an increased risk of malignancy and closely linked to papillary thyroid carcinoma (PTC), the most common type of thyroid cancer. However, existing diagnostic ultrasound (US) imaging has critical limitations such as inability to detect subtle calcifications via standard static imaging, leading to 15-20% delayed PTC treatment or unnecessary fine-needle aspiration. This study aimed to develop a calcification-optimized, interpretable deep learning (DL) model based on dynamic ultrasound videos to determine the malignancy nature of calcified thyroid nodules.
Design and methods: This study retrospectively collected ultrasound dynamic video data from 1,257 patients, containing 2,319 thyroid nodules across six hospitals between January 2020 and October 2023. Various DL models were constructed with optimization specifically implemented on the 3D InceptionResNetV2 network by including a calcification attention module to enhance sensitivity to micro-calcifications. Model performance was compared not only with those trained on 2D static ultrasound images, but also against diagnoses from four clinicians (2 junior and 2 senior radiologists). The dataset was split into training (70%, 1,623 videos), validation (10%, 232 videos), internal test (10%, 232 videos), and external test (10%, 232 videos) sets.
Results: On the external test set, the optimized 3D InceptionResNetV2 model trained with dynamic videos outperformed the other four 3D DL models across all metrics: AUROC of 0.916, sensitivity of 0.860, and specificity of 0.834. Its AUROC was significantly higher than that of radiologists (0.916 versus 0.638; p < 0.0001). Additionally, with the assistance of the optimized model, radiologists' diagnostic accuracy improved by 16.9% (junior) and 11.1% (senior) in the external cohort. 3D Grad-CAM further confirmed the model focused on calcified regions (consistent with clinical diagnostic logic) by generating interpretable heatmaps.
Conclusion: A calcification-optimized DL model trained on dynamic ultrasound videos was proposed to efficiently and accurately predict the benign/malignant nature of calcified nodules. This tool shows promises as a non-invasive, interpretable tool for early PTC detection, supporting timely diagnosis and treatment planning.
Keywords: 2D static ultrasound images; Calcification attention module; Deep learning (DL); Papillary thyroid carcinoma (PTC); Thyroid calcified nodules; Ultrasound dynamic video.
© 2025. The Author(s).