Machine learning approaches to decipher hormone and HER2 receptor status phenotypes in breast cancer

Brief Bioinform. 2019 Mar 22;20(2):504-514. doi: 10.1093/bib/bbx138.

Abstract

Breast cancer prognosis and administration of therapies are aided by knowledge of hormonal and HER2 receptor status. Breast cancer lacking estrogen receptors, progesterone receptors and HER2 receptors are difficult to treat. Regarding large data repositories such as The Cancer Genome Atlas, available wet-lab methods for establishing the presence of these receptors do not always conclusively cover all available samples. To this end, we introduce median-supplement methods to identify hormonal and HER2 receptor status phenotypes of breast cancer patients using gene expression profiles. In these approaches, supplementary instances based on median patient gene expression are introduced to balance a training set from which we build simple models to identify the receptor expression status of patients. In addition, for the purpose of benchmarking, we examine major machine learning approaches that are also applicable to the problem of finding receptor status in breast cancer. We show that our methods are robust and have high sensitivity with extremely low false-positive rates compared with the well-established methods. A successful application of these methods will permit the simultaneous study of large collections of samples of breast cancer patients as well as save time and cost while standardizing interpretation of outcomes of such studies.

Keywords: HER2 receptor status; breast cancer; classification; hormone receptor status; machine learning.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Biomarkers, Tumor / metabolism
  • Breast Neoplasms / classification*
  • Breast Neoplasms / genetics
  • Breast Neoplasms / metabolism*
  • Computational Biology / methods
  • Female
  • Gene Expression Profiling / methods*
  • Humans
  • Machine Learning*
  • Phenotype
  • Prognosis
  • Receptor, ErbB-2 / metabolism*
  • Receptors, Estrogen / metabolism
  • Receptors, Progesterone / metabolism

Substances

  • Biomarkers, Tumor
  • Receptors, Estrogen
  • Receptors, Progesterone
  • ERBB2 protein, human
  • Receptor, ErbB-2