Stool microbiota are superior to saliva in distinguishing cirrhosis and hepatic encephalopathy using machine learning

J Hepatol. 2022 Mar;76(3):600-607. doi: 10.1016/j.jhep.2021.11.011. Epub 2021 Nov 15.

Abstract

Background & aims: Saliva and stool microbiota are altered in cirrhosis. Since stool is logistically difficult to collect compared to saliva, it is important to determine their relative diagnostic and prognostic capabilities. We aimed to determine the ability of stool vs. saliva microbiota to differentiate between groups based on disease severity using machine learning (ML).

Methods: Controls and outpatients with cirrhosis underwent saliva and stool microbiome analysis. Controls vs. cirrhosis and within cirrhosis (based on hepatic encephalopathy [HE], proton pump inhibitor [PPI] and rifaximin use) were classified using 4 ML techniques (random forest [RF], support vector machine, logistic regression, and gradient boosting) with AUC comparisons for stool, saliva or both sample types. Individual microbial contributions were computed using feature importance of RF and Shapley additive explanations. Finally, thresholds for including microbiota were varied between 2.5% and 10%, and core microbiome (DESeq2) analysis was performed.

Results: Two hundred and sixty-nine participants, including 87 controls and 182 patients with cirrhosis, of whom 57 had HE, 78 were on PPIs and 29 on rifaximin were included. Regardless of the ML model, stool microbiota had a significantly higher AUC in differentiating groups vs. saliva. Regarding individual microbiota: autochthonous taxa drove the difference between controls vs. patients with cirrhosis, oral-origin microbiota the difference between PPI users/non-users, and pathobionts and autochthonous taxa the difference between rifaximin users/non-users and patients with/without HE. These were consistent with the core microbiome analysis results.

Conclusions: On ML analysis, stool microbiota composition is significantly more informative in differentiating between controls and patients with cirrhosis, and those with varying cirrhosis severity, compared to saliva. Despite logistic challenges, stool should be preferred over saliva for microbiome analysis.

Lay summary: Since it is harder to collect stool than saliva, we wanted to test whether microbes from saliva were better than stool in differentiating between healthy people and those with cirrhosis and, among those with cirrhosis, those with more severe disease. Using machine learning, we found that microbes in stool were more accurate than saliva alone or in combination, therefore, stool should be preferred for analysis and collection wherever possible.

Keywords: Machine Learning; Proton pump inhibitors; Random Forest classifier; Rifaximin; SHAP.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Aged
  • Feces / microbiology*
  • Female
  • Hepatic Encephalopathy / diagnosis*
  • Hepatic Encephalopathy / physiopathology
  • Humans
  • Liver Cirrhosis / diagnosis*
  • Liver Cirrhosis / physiopathology
  • Machine Learning / standards
  • Machine Learning / statistics & numerical data
  • Male
  • Mass Screening / methods
  • Mass Screening / standards*
  • Mass Screening / statistics & numerical data
  • Microbiota / physiology
  • Middle Aged
  • Prognosis
  • Saliva / microbiology*