Hierarchical Clustering Analyses of Plasma Proteins in Subjects With Cardiovascular Risk Factors Identify Informative Subsets Based on Differential Levels of Angiogenic and Inflammatory Biomarkers

Front Neurosci. 2020 Feb 6:14:84. doi: 10.3389/fnins.2020.00084. eCollection 2020.

Abstract

Agglomerative hierarchical clustering analysis (HCA) is a commonly used unsupervised machine learning approach for identifying informative natural clusters of observations. HCA is performed by calculating a pairwise dissimilarity matrix and then clustering similar observations until all observations are grouped within a cluster. Verifying the empirical clusters produced by HCA is complex and not well studied in biomedical applications. Here, we demonstrate the comparability of a novel HCA technique with one that was used in previous biomedical applications while applying both techniques to plasma angiogenic (FGF, FLT, PIGF, Tie-2, VEGF, VEGF-D) and inflammatory (MMP1, MMP3, MMP9, IL8, TNFα) protein data to identify informative subsets of individuals. Study subjects were diagnosed with mild cognitive impairment due to cerebrovascular disease (MCI-CVD). Through comparison of the two HCA techniques, we were able to identify subsets of individuals, based on differences in VEGF (p < 0.001), MMP1 (p < 0.001), and IL8 (p < 0.001) levels. These profiles provide novel insights into angiogenic and inflammatory pathologies that may contribute to VCID.

Keywords: IL8; MMP1; VEGF; hierarchical clustering analysis; mild cognitive impairment; vascular cognitive impairment and dementia.