Bayesian identification of differentially expressed isoforms using a novel joint model of RNA-seq data

PLoS Comput Biol. 2025 Jan 31;21(1):e1012750. doi: 10.1371/journal.pcbi.1012750. eCollection 2025 Jan.

Abstract

We develop a Bayesian approach, BayesIso, to identify differentially expressed isoforms from RNA-seq data. The approach features a novel joint model of the sample variability and the deferential state of isoforms. Specifically, the within-sample variability and the between-sample variability of each isoform are modeled by a Poisson-Lognormal model and a Gamma-Gamma model, respectively. Using a Bayesian framework, the differential state of each isoform and the model parameters are jointly estimated by a Markov Chain Monte Carlo (MCMC) method. Extensive studies using simulation and real data demonstrate that BayesIso can effectively detect isoforms of less differentially expressed and differential transcripts for genes with multiple isoforms. We applied the approach to breast cancer RNA-seq data and uncovered a unique set of isoforms that form key pathways associated with breast cancer recurrence. First, PI3K/AKT/mTOR signaling and PTEN signaling pathways are identified as being involved in breast cancer development. Further integrated with protein-protein interaction data, pathways of Jak-STAT, mTOR, MAPK and Wnt signaling are revealed in association with breast cancer recurrence. Finally, several pathways are activated in the early recurrence of breast cancer. In tumors that occur early, members of pathways of cellular metabolism and cell cycle (such as CD36 and TOP2A) are upregulated, while immune response genes such as NFATC1 are downregulated.

MeSH terms

  • Bayes Theorem
  • Breast Neoplasms / genetics
  • Breast Neoplasms / metabolism
  • Computational Biology / methods
  • Computer Simulation
  • Female
  • Gene Expression Profiling / methods
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Markov Chains
  • Monte Carlo Method
  • Protein Isoforms / genetics
  • Protein Isoforms / metabolism
  • RNA-Seq* / methods
  • Signal Transduction / genetics

Substances

  • Protein Isoforms