HER2GAN: Overcome the Scarcity of HER2 Breast Cancer Dataset Based on Transfer Learning and GAN Model

Clin Breast Cancer. 2024 Jan;24(1):53-64. doi: 10.1016/j.clbc.2023.09.014. Epub 2023 Sep 29.

Abstract

Introduction: Immunohistochemistry (IHC) is crucial for breast cancer diagnosis, classification, and individualized treatment. IHC is used to measure the levels of expression of hormone receptors (estrogen and progesterone receptors), human epidermal growth factor receptor 2 (HER2), and other biomarkers, which are used to make treatment decisions and predict how well a patient will do. The evaluation of the breast cancer score on IHC slides, taking into account structural and morphological features as well as a scarcity of relevant data, is one of the most important issues in the IHC debate. Several recent studies have utilized machine learning and deep learning techniques to resolve these issues.

Materials and methods: This paper introduces a new approach for addressing the issue based on supervised deep learning. A GAN-based model is proposed for generating high-quality HER2 images and identifying and classifying HER2 levels. Using transfer learning methodologies, the original and generated images were evaluated.

Results and conclusion: All of the models have been trained and evaluated using publicly accessible and private data sets, respectively. The InceptionV3 and InceptionResNetV2 models achieved a high accuracy of 93% with the combined generated and original images used for training and testing, demonstrating the exceptional quality of the details in the synthesized images.

Keywords: CGAN; Deep learning; HER2; HER2 synthesized images.; Immunohistochemistry.

MeSH terms

  • Biomarkers, Tumor / metabolism
  • Breast Neoplasms* / diagnosis
  • Breast Neoplasms* / metabolism
  • Estrogens
  • Female
  • Humans
  • Machine Learning
  • Receptors, Progesterone / metabolism

Substances

  • Biomarkers, Tumor
  • Receptors, Progesterone
  • Estrogens