Tissue-specific transfer learning improves functional variant and therapeutic target discoveries in breast and prostate cancer

PLoS Genet. 2026 May 6;22(5):e1012145. doi: 10.1371/journal.pgen.1012145. eCollection 2026 May.

Abstract

DNA foundation models trained on large-scale genomic and epigenetic datasets have shown promise for regulatory variant interpretation, yet their application to tissue-specific contexts remain limited. Here, we present a transfer learning (TL) framework to adapt Enformer, a deep neural network trained on 5,313 multi-omics tracks, to breast and prostate cancer using 275 and 357 tissue-specific transcription factor (TF) ChIP-seq tracks, respectively. We computed tissue-specific cis-regulatory activity (tCRA) scores for millions of single-nucleotide variants (SNVs) in genome-wide association study (GWAS) datasets and prioritized high-impact SNV subsets (1M, 1.5M, and 2M). These TL-prioritized variants demonstrated consistently greater enrichment in tissue-specific enhancers, cancer GWAS risk variants, and ClinVar pathogenic variants compared to the original Enformer model. Transcriptome-wide association studies (TWAS) using TL-based SNVs identified more cancer-relevant genes, many of which exhibited functional essentiality (DepMap), therapeutic tractability (drug databases), and disease relevance (DisGeNET). Notably, TL models outperformed the base model in identifying genes enriched for drug targets and clinically relevant disease associations. Our results show that TL-derived tCRA scores enhance regulatory variant prioritization and improve susceptibility gene discovery in a tissue-specific manner. Our study provides a generalizable framework for tailoring foundation models to disease-relevant contexts, with implications for variant interpretation, therapeutic target discovery, and precision medicine.

MeSH terms

  • Breast Neoplasms* / drug therapy
  • Breast Neoplasms* / genetics
  • Breast Neoplasms* / pathology
  • Female
  • Gene Expression Regulation, Neoplastic
  • Genome-Wide Association Study / methods
  • Humans
  • Male
  • Organ Specificity / genetics
  • Polymorphism, Single Nucleotide / genetics
  • Prostatic Neoplasms* / drug therapy
  • Prostatic Neoplasms* / genetics
  • Prostatic Neoplasms* / pathology
  • Transcription Factors / genetics
  • Transcriptome / genetics

Substances

  • Transcription Factors