Development of a deep learning-based automated diagnostic system (DLADS) for classifying mammographic lesions - a first large-scale multi-institutional clinical trial in Japan

Breast Cancer. 2025 Sep;32(5):1115-1124. doi: 10.1007/s12282-025-01741-3. Epub 2025 Jul 3.

Abstract

Background: Recently, western countries have built evidence on mammographic artificial Intelligence-computer-aided diagnosis (AI-CADx) systems; however, their effectiveness has not yet been sufficiently validated in Japanese women. In this study, we aimed to establish a Japanese mammographic AI-CADx system for the first time.

Methods: We retrospectively collected screening or diagnostic mammograms from 63 institutions in Japan. We then randomly divided the images into training, validation, and test datasets in a balanced ratio of 8:1:1 on a case-level basis. The gold standard of annotation for the AI-CADx system is mammographic findings based on pathologic references. The AI-CADx system was developed using SE-ResNet modules and a sliding window algorithm. A cut-off concentration gradient of the heatmap image was set at 15%. The AI-CADx system was considered accurate if it detected the presence of a malignant lesion in a breast cancer mammogram. The primary endpoint of the AI-CADx system was defined as a sensitivity and specificity of over 80% for breast cancer diagnosis in the test dataset.

Results: We collected 20,638 mammograms from 11,450 Japanese women with a median age of 55 years. The mammograms included 5019 breast cancer (24.3%), 5026 benign (24.4%), and 10,593 normal (51.3%) mammograms. In the test dataset of 2059 mammograms, the AI-CADx system achieved a sensitivity of 83.5% and a specificity of 84.7% for breast cancer diagnosis. The AUC in the test dataset was 0.841 (DeLong 95% CI; 0.822-0.859). The Accuracy was almost consistent independent of breast density, mammographic findings, type of cancer, and mammography vendors (AUC (range); 0.639-0.906).

Conclusions: The developed Japanese mammographic AI-CADx system diagnosed breast cancer with a pre-specified sensitivity and specificity. We are planning a prospective study to validate the breast cancer diagnostic performance of Japanese physicians using this AI-CADx system as a second reader.

Trial registration: UMIN, trial number UMIN000039009. Registered 26 December 2019, https://www.umin.ac.jp/ctr/.

Keywords: Artificial intelligence; Breast cancer; Computer-aided detection; Computer-aided diagnosis; Convolutional neural network; Mammography.

Publication types

  • Multicenter Study

MeSH terms

  • Adult
  • Aged
  • Breast Neoplasms* / classification
  • Breast Neoplasms* / diagnosis
  • Breast Neoplasms* / diagnostic imaging
  • Breast Neoplasms* / pathology
  • Deep Learning*
  • Diagnosis, Computer-Assisted* / methods
  • Early Detection of Cancer / methods
  • Female
  • Humans
  • Japan
  • Mammography* / methods
  • Middle Aged
  • Retrospective Studies
  • Sensitivity and Specificity