Integration of gene co-expression analysis and multi-class SVM specifies the functional players involved in determining the fate of HTLV-1 infection toward the development of cancer (ATLL) or neurological disorder (HAM/TSP)

PLoS One. 2022 Jan 18;17(1):e0262739. doi: 10.1371/journal.pone.0262739. eCollection 2022.

Abstract

Human T-cell Leukemia Virus type-1 (HTLV-1) is an oncovirus that may cause two main life-threatening diseases including a cancer type named Adult T-cell Leukemia/Lymphoma (ATLL) and a neurological and immune disturbance known as HTLV-1 Associated Myelopathy/Tropical Spastic Paraparesis (HAM/TSP). However, a large number of the infected subjects remain as asymptomatic carriers (ACs). There is no comprehensive study that determines which dysregulated genes differentiate the pathogenesis routes toward ATLL or HAM/TSP. Therefore, two main algorithms including weighted gene co-expression analysis (WGCNA) and multi-class support vector machines (SVM) were utilized to find major gene players in each condition. WGCNA was used to find the highly co-regulated genes and multi-class SVM was employed to identify the most important classifier genes. The identified modules from WGCNA were validated in the external datasets. Furthermore, to find specific modules for ATLL and HAM/TSP, the non-preserved modules in another condition were found. In the next step, a model was constructed by multi-class SVM. The results revealed 467, 3249, and 716 classifiers for ACs, ATLL, and HAM/TSP, respectively. Eventually, the common genes between the WGCNA results and classifier genes resulted from multi-class SVM that also determined as differentially expressed genes, were identified. Through these step-wise analyses, PAIP1, BCAS2, COPS2, CTNNB1, FASLG, GTPBP1, HNRNPA1, RBBP6, TOP1, SLC9A1, JMY, PABPC3, and PBX1 were found as the possible critical genes involved in the progression of ATLL. Moreover, FBXO9, ZNF526, ERCC8, WDR5, and XRCC3 were identified as the conceivable major involved genes in the development of HAM/TSP. These genes can be proposed as specific biomarker candidates and therapeutic targets for each disease.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Gene Expression Profiling
  • Gene Expression Regulation*
  • Genetic Markers*
  • HTLV-I Infections / complications*
  • HTLV-I Infections / genetics
  • HTLV-I Infections / metabolism
  • HTLV-I Infections / virology
  • Human T-lymphotropic virus 1 / genetics*
  • Humans
  • Leukemia-Lymphoma, Adult T-Cell / etiology
  • Leukemia-Lymphoma, Adult T-Cell / metabolism
  • Leukemia-Lymphoma, Adult T-Cell / pathology*
  • Nervous System Diseases / etiology
  • Nervous System Diseases / metabolism
  • Nervous System Diseases / pathology*
  • Support Vector Machine*

Substances

  • Genetic Markers

Grants and funding

The authors received financial support from the Iran National Science Foundation (INSF). The authors also acknowledge the University of Isfahan for supporting this research through the postdoctoral program. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.