Recurrent repeat expansions in human cancer genomes

Nature. 2023 Jan;613(7942):96-102. doi: 10.1038/s41586-022-05515-1. Epub 2022 Dec 14.


Expansion of a single repetitive DNA sequence, termed a tandem repeat (TR), is known to cause more than 50 diseases1,2. However, repeat expansions are often not explored beyond neurological and neurodegenerative disorders. In some cancers, mutations accumulate in short tracts of TRs, a phenomenon termed microsatellite instability; however, larger repeat expansions have not been systematically analysed in cancer3-8. Here we identified TR expansions in 2,622 cancer genomes spanning 29 cancer types. In seven cancer types, we found 160 recurrent repeat expansions (rREs), most of which (155/160) were subtype specific. We found that rREs were non-uniformly distributed in the genome with enrichment near candidate cis-regulatory elements, suggesting a potential role in gene regulation. One rRE, a GAAA-repeat expansion, located near a regulatory element in the first intron of UGT2B7 was detected in 34% of renal cell carcinoma samples and was validated by long-read DNA sequencing. Moreover, in preliminary experiments, treating cells that harbour this rRE with a GAAA-targeting molecule led to a dose-dependent decrease in cell proliferation. Overall, our results suggest that rREs may be an important but unexplored source of genetic variation in human cancer, and we provide a comprehensive catalogue for further study.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Carcinoma, Renal Cell / genetics
  • Carcinoma, Renal Cell / pathology
  • Cell Proliferation / drug effects
  • DNA Repeat Expansion* / genetics
  • Gene Expression Regulation
  • Genome, Human* / genetics
  • Humans
  • Introns / genetics
  • Neoplasms* / classification
  • Neoplasms* / genetics
  • Neoplasms* / pathology
  • Regulatory Elements, Transcriptional / genetics
  • Reproducibility of Results
  • Sequence Analysis, DNA


  • UGT2B7 protein, human