Integrated analysis of single-cell RNA-seq dataset and bulk RNA-seq dataset constructs a prognostic model for predicting survival in human glioblastoma

Brain Behav. 2022 May;12(5):e2575. doi: 10.1002/brb3.2575. Epub 2022 Apr 16.

Abstract

Background: Glioblastoma (GBM) is the most common primary malignant brain tumor in adults. For patients with GBM, the median overall survival (OS) is 14.6 months and the 5-year survival rate is 7.2%. It is imperative to develop a reliable model to predict the survival probability in new GBM patients. To date, most prognostic models for predicting survival in GBM were constructed based on bulk RNA-seq dataset, which failed to accurately reflect the difference between tumor cores and peripheral regions, and thus show low predictive capability. An effective prognostic model is desperately needed in clinical practice.

Methods: We studied single-cell RNA-seq dataset and The Cancer Genome Atlas-glioblastoma multiforme (TCGA-GBM) dataset to identify differentially expressed genes (DEGs) that impact the OS of GBM patients. We then applied the least absolute shrinkage and selection operator (LASSO) Cox penalized regression analysis to determine the optimal genes to be included in our risk score prognostic model. Then, we used another dataset to test the accuracy of our risk score prognostic model.

Results: We identified 2128 DEGs from the single-cell RNA-seq dataset and 6461 DEGs from the bulk RNA-seq dataset. In addition, 896 DEGs associated with the OS of GBM patients were obtained. Five of these genes (LITAF, MTHFD2, NRXN3, OSMR, and RUFY2) were selected to generate a risk score prognostic model. Using training and validation datasets, we found that patients in the low-risk group showed better OS than those in the high-risk group. We validated our risk score model with the training and validating datasets and demonstrated that it can effectively predict the OS of GBM patients.

Conclusion: We constructed a novel prognostic model to predict survival in GBM patients by integrating a scRNA-seq dataset and a bulk RNA-seq dataset. Our findings may advance the development of new therapeutic targets and improve clinical outcomes for GBM patients.

Keywords: bulk RNA-seq; glioblastoma; overall survival; prognostic model; single-cell RNA-seq.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Brain Neoplasms* / genetics
  • Gene Expression Regulation, Neoplastic
  • Glioblastoma* / genetics
  • Humans
  • Prognosis
  • RNA-Seq