Understanding the host-pathogen evolutionary balance through Gaussian process modeling of SARS-CoV-2

Patterns (N Y). 2023 Jul 21;4(8):100800. doi: 10.1016/j.patter.2023.100800. eCollection 2023 Aug 11.

Abstract

We have developed a machine learning (ML) approach using Gaussian process (GP)-based spatial covariance (SCV) to track the impact of spatial-temporal mutational events driving host-pathogen balance in biology. We show how SCV can be applied to understanding the response of evolving covariant relationships linking the variant pattern of virus spread to pathology for the entire SARS-CoV-2 genome on a daily basis. We show that GP-based SCV relationships in conjunction with genome-wide co-occurrence analysis provides an early warning anomaly detection (EWAD) system for the emergence of variants of concern (VOCs). EWAD can anticipate changes in the pattern of performance of spread and pathology weeks in advance, identifying signatures destined to become VOCs. GP-based analyses of variation across entire viral genomes can be used to monitor micro and macro features responsible for host-pathogen balance. The versatility of GP-based SCV defines starting point for understanding nature's evolutionary path to complexity through natural selection.

Keywords: Gaussian processes; SARS-CoV-2; early warning; evolution; genomic surveillance; host-pathogen; machine learning; spatial covariance.