R code and downstream analysis objects for the scRNA-seq atlas of normal and tumorigenic human breast tissue

Sci Data. 2022 Mar 23;9(1):96. doi: 10.1038/s41597-022-01236-2.

Abstract

Breast cancer is a common and highly heterogeneous disease. Understanding cellular diversity in the mammary gland and its surrounding micro-environment across different states can provide insight into cancer development in the human breast. Recently, we published a large-scale single-cell RNA expression atlas of the human breast spanning normal, preneoplastic and tumorigenic states. Single-cell expression profiles of nearly 430,000 cells were obtained from 69 distinct surgical tissue specimens from 55 patients. This article extends the study by providing quality filtering thresholds, downstream processed R data objects, complete cell annotation and R code to reproduce all the analyses. Data quality assessment measures are presented and details are provided for all the bioinformatic analyses that produced results described in the study.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms* / genetics
  • Computational Biology
  • Datasets as Topic
  • Exome Sequencing
  • Female
  • Gene Expression Profiling
  • Humans
  • Sequence Analysis, RNA*
  • Single-Cell Analysis*
  • Tumor Microenvironment