Comprehensive analysis of pathways in Coronavirus 2019 (COVID-19) using an unsupervised machine learning method

Appl Soft Comput. 2022 Oct:128:109510. doi: 10.1016/j.asoc.2022.109510. Epub 2022 Aug 17.

Abstract

The World Health Organization (WHO) introduced "Coronavirus disease 19" or "COVID-19" as a novel coronavirus in March 2020. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) requires the fast discovery of effective treatments to fight this worldwide crisis. Artificial intelligence and bioinformatics analysis pipelines can assist with finding biomarkers, explanations, and cures. Artificial intelligence and machine learning methods provide powerful infrastructures for interpreting and understanding the available data. On the other hand, pathway enrichment analysis, as a dominant tool, could help researchers discover potential key targets present in biological pathways of host cells that are targeted by SARS-CoV-2. In this work, we propose a two-stage machine learning approach for pathway analysis. During the first stage, four informative gene sets that can represent important COVID-19 related pathways are selected. These "representative genes" are associated with the COVID-19 pathology. Then, two distinctive networks were constructed for COVID-19 related signaling and disease pathways. In the second stage, the pathways of each network are ranked with respect to some unsupervised scorning method based on our defined informative features. Finally, we present a comprehensive analysis of the top important pathways in both networks. Materials and implementations are available at: https://github.com/MahnazHabibi/Pathway.

Keywords: Coronavirus disease 2019; Machine learning; SARS-CoV-2; Unsupervised learning.