Microarray Data Mining and Preliminary Bioinformatics Analysis of Hepatitis D Virus-Associated Hepatocellular Carcinoma

Biomed Res Int. 2021 Jan 30:2021:1093702. doi: 10.1155/2021/1093702. eCollection 2021.

Abstract

Several studies have demonstrated that chronic hepatitis delta virus (HDV) infection is associated with a worsening of hepatitis B virus (HBV) infection and increased risk of hepatocellular carcinoma (HCC). However, there is limited data on the role of HDV in the oncogenesis of HCC. This study is aimed at assessing the potential mechanisms of HDV-associated hepatocarcinogenesis, especially to screen and identify key genes and pathways possibly involved in the pathogenesis of HCC. We selected three microarray datasets: GSE55092 contains 39 cancer specimens and 81 paracancer specimens from 11 HBV-associated HCC patients, GSE98383 contains 11 cancer specimens and 24 paracancer specimens from 5 HDV-associated HCC patients, and 371 HCC patients with the RNA-sequencing data combined with their clinical data from the Cancer Genome Atlas (TCGA). Afterwards, 948 differentially expressed genes (DEGs) closely related to HDV-associated HCC were obtained using the R package and filtering with a Venn diagram. We then performed gene ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis to determine the biological processes (BP), cellular component (CC), molecular function (MF), and KEGG signaling pathways most enriched for DEGs. Additionally, we performed Weighted Gene Coexpression Network Analysis (WGCNA) and protein-to-protein interaction (PPI) network construction with 948 DEGs, from which one module was identified by WGCNA and three modules were identified by the PPI network. Subsequently, we validated the expression of 52 hub genes from the PPI network with an independent set of HCC dataset stored in the Gene Expression Profiling Interactive Analysis (GEPIA) database. Finally, seven potential key genes were identified by intersecting with key modules from WGCNA, including 3 reported genes, namely, CDCA5, CENPH, and MCM7, and 4 novel genes, namely, CDC6, CDC45, CDCA8, and MCM4, which are associated with nucleoplasm, cell cycle, DNA replication, and mitotic cell cycle. The CDCA8 and stage of HCC were the independent factors associated with overall survival of HDV-associated HCC. All the related findings of these genes can help gain a better understanding of the role of HDV in the underlying mechanism of HCC carcinogenesis.

MeSH terms

  • Biomarkers, Tumor / genetics
  • Carcinoma, Hepatocellular / complications
  • Carcinoma, Hepatocellular / genetics*
  • Carcinoma, Hepatocellular / pathology
  • Carcinoma, Hepatocellular / virology
  • Computational Biology
  • Data Mining / statistics & numerical data
  • Gene Expression Regulation, Neoplastic / genetics
  • Hepatitis B / complications
  • Hepatitis B / genetics*
  • Hepatitis B / pathology
  • Hepatitis Delta Virus / genetics
  • Hepatitis Delta Virus / pathogenicity
  • Humans
  • Liver Neoplasms / complications
  • Liver Neoplasms / genetics*
  • Liver Neoplasms / pathology
  • Liver Neoplasms / virology
  • Microarray Analysis / statistics & numerical data
  • Neoplasm Proteins / genetics*
  • Protein Interaction Maps / genetics

Substances

  • Biomarkers, Tumor
  • Neoplasm Proteins