Gene signature discovery and systematic validation across diverse clinical cohorts for TB prognosis and response to treatment

PLoS Comput Biol. 2023 Jul 20;19(7):e1010770. doi: 10.1371/journal.pcbi.1010770. eCollection 2023 Jul.

Abstract

While blood gene signatures have shown promise in tuberculosis (TB) diagnosis and treatment monitoring, most signatures derived from a single cohort may be insufficient to capture TB heterogeneity in populations and individuals. Here we report a new generalized approach combining a network-based meta-analysis with machine-learning modeling to leverage the power of heterogeneity among studies. The transcriptome datasets from 57 studies (37 TB and 20 viral infections) across demographics and TB disease states were used for gene signature discovery and model training and validation. The network-based meta-analysis identified a common 45-gene signature specific to active TB disease across studies. Two optimized random forest regression models, using the full or partial 45-gene signature, were then established to model the continuum from Mycobacterium tuberculosis infection to disease and treatment response. In model validation, using pooled multi-cohort datasets to mimic the real-world setting, the model provides robust predictive performance for incipient to active TB risk over a 2.5-year period with an AUROC of 0.85, 74.2% sensitivity, and 78.3% specificity, which approximates the minimum criteria (>75% sensitivity and >75% specificity) within the WHO target product profile for prediction of progression to TB. Moreover, the model strongly discriminates active TB from viral infection (AUROC 0.93, 95% CI 0.91-0.94). For treatment monitoring, the TB scores generated by the model statistically correlate with treatment responses over time and were predictive, even before treatment initiation, of standard treatment clinical outcomes. We demonstrate an end-to-end gene signature model development scheme that considers heterogeneity for TB risk estimation and treatment monitoring.

Publication types

  • Meta-Analysis

MeSH terms

  • Disease Progression
  • Humans
  • Mycobacterium tuberculosis* / genetics
  • Transcriptome / genetics
  • Treatment Outcome
  • Tuberculosis* / diagnosis
  • Tuberculosis* / drug therapy
  • Tuberculosis* / genetics

Grants and funding

The author(s) received no specific funding for this work.