Deep whole genome sequencing identifies recurrent genomic alterations in commonly used breast cancer cell lines and patient-derived xenograft models

Breast Cancer Res. 2022 Sep 24;24(1):63. doi: 10.1186/s13058-022-01540-0.

Abstract

Background: Breast cancer cell lines (BCCLs) and patient-derived xenografts (PDXs) are the most frequently used models in breast cancer research. Despite their widespread usage, genome sequencing of these models is incomplete, with previous studies only focusing on targeted gene panels, whole exome or shallow whole genome sequencing. Deep whole genome sequencing is the most sensitive and accurate method to detect single nucleotide variants and indels, gene copy number and structural events such as gene fusions.

Results: Here we describe deep whole genome sequencing (WGS) of commonly used BCCL and PDX models using the Illumina X10 platform with an average ~ 60 × coverage. We identify novel genomic alterations, including point mutations and genomic rearrangements at base-pair resolution, compared to previously available sequencing data. Through integrative analysis with publicly available functional screening data, we annotate new genomic features likely to be of biological significance. CSMD1, previously identified as a tumor suppressor gene in various cancer types, including head and neck, lung and breast cancers, has been identified with deletion in 50% of our PDX models, suggesting an important role in aggressive breast cancers.

Conclusions: Our WGS data provides a comprehensive genome sequencing resource of these models.

Keywords: Breast cancer cell lines; Non-coding mutations; Patient-derived xenografts; Structural variants; Whole genome sequencing.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Breast Neoplasms* / genetics
  • Disease Models, Animal
  • Female
  • Genomics / methods
  • Heterografts
  • Humans
  • MCF-7 Cells
  • Nucleotides
  • Whole Genome Sequencing

Substances

  • Nucleotides