Cluster analysis categorizes five phenotypes of pulmonary tuberculosis

Sci Rep. 2022 Jun 16;12(1):10084. doi: 10.1038/s41598-022-13526-1.

Abstract

Tuberculosis (TB) has a heterogeneous phenotype, which makes it challenging to diagnose. Our study aimed to identify TB phenotypes through cluster analysis and compare their initial symptomatic, microbiological and radiographic characteristics. We systemically collected data of notified TB patients notified in Korea and constructed a prospective, observational cohort database. Cluster analysis was performed using K-means clustering, and the variables to be included were determined by correlation network. A total of 4,370 subjects with pulmonary TB were enrolled in the study. Based on the correlation network, age and body mass index (BMI) were selected for the cluster analysis. Five clusters were identified and characterised as follows: (1) middle-aged overweight male dominance, (2) young-aged relatively female dominance without comorbidities, (3) middle-aged underweight male dominance, (4) overweight elderly with comorbidities and (5) underweight elderly with comorbidities. All clusters had distinct demographic and symptomatic characteristics. Initial microbiologic burdens and radiographic features also varied, including the presence of cavities and bilateral infiltration, which reflect TB-related severity. Cluster analysis of age and BMI identified five phenotypes of pulmonary TB with significant differences at initial clinical presentations. Further studies are necessary to validate our results and to assess their clinical implications.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Cluster Analysis
  • Female
  • Humans
  • Male
  • Middle Aged
  • Overweight
  • Phenotype
  • Prospective Studies
  • Risk Factors
  • Thinness
  • Tuberculosis*
  • Tuberculosis, Pulmonary* / diagnostic imaging
  • Tuberculosis, Pulmonary* / epidemiology
  • Tuberculosis, Pulmonary* / microbiology