Identification of Hub genes with prognostic values in colorectal cancer by integrated bioinformatics analysis

Cancer Biomark. 2024 Feb 1. doi: 10.3233/CBM-230113. Online ahead of print.

Abstract

Background: Our study aimed to investigate the Hub genes and their prognostic value in colorectal cancer (CRC) via bioinformatics analysis.

Methods: The data set of colorectal cancer was downloaded from the GEO database (GSE21510, GSE110224 and GSE74602) for differential expression analysis using the GEO2R tool. Hub genes were screened by protein-protein interaction (PPI) comprehensive analysis. GEPIA was used to verify the expression of Hub genes and evaluate its prognostic value. The protein expression of Hub gene in CRC was analyzed using the Human Protein Atlas database. The cBioPortal was used to analyze the type and frequency of Hub gene mutations, and the effects of mutation on the patients' prognosis. The TIMER database was used to study the correlation between Hub genes and immune infiltration in CRC. Gene set enrichment analysis (GSEA) was used to explore the biological function and signal pathway of the Hub genes and corresponding co-expressed genes.

Results: We identified 346 differentially expressed genes (DEGs), including 117 upregulated and 229 downregulated. Four Hub genes (AURKA, CCNB1, EXO1 and CCNA2) were selected by survival analysis and differential expression validation. The protein and mRNA expression levels of AURKA, CCNB1, EXO1 and CCNA2 were higher in CRC tissues than in adjacent tissues. There were varying degrees of immune cell infiltration and gene mutation of Hub genes, especially B cells and CD8+ T cells. The results of GSEA showed that Hub genes and their co-expressed genes mainly participated in chromosome segregation, DNA replication, translational elongation and cell cycle.

Conclusion: Overexpression of AURKA, CCNB1, CCNA2 and EXO1 had a better prognosis for CRC and this effect was correlation with gene mutation and infiltration of immune cells.

Keywords: Colorectal cancer; Hub genes; bioinformatics analysis; prognosis.