Predicting the incidence risk of ischemic stroke in a hospital population of southern China: a classification tree analysis

J Neurol Sci. 2011 Jul 15;306(1-2):108-14. doi: 10.1016/j.jns.2011.03.032. Epub 2011 Apr 13.


Objective: To determine the major risk factors and their interactions of ischemic stroke (IS) and to develop a classification tree model to predict the incidence risk of IS for a Chinese population.

Methods: Exhaustive Chi-squared Automatic Interaction Detection (Exhaustive CHAID) algorithm of classification tree method was applied to build a prediction model for the incidence risk of IS under the design of 1:1 matched case-control study. The statistics of misclassification risk was used to evaluate the fitness of the model.

Results: In the prediction model, six variables of physical exercise, history of hypertension, tea drinking, HDL-c level, smoking status and educational level were in turn selected as the predictors of IS incidence risk. In the subgroup of lacking of physical exercise, individuals who had history of hypertension would have a significantly higher IS risk (92%) than that of the ones who had no history of hypertension (64%). The misclassification risk estimate of the prediction model was 0.21 with the standard error of 0.02, indicating that 79% of the cases could be classified correctly based on current prediction model.

Conclusions: Lacking of physical exercise and history of hypertension are identified to be the prominent predicting variables of IS risk for a hospital population of southern China. Although CHAID analysis could provide detailed information and insight about interactions among risk factors of IS, we still need to validate our model and improve the vascular risk prediction for Chinese subjects in further studies.

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Brain Ischemia / complications
  • Brain Ischemia / epidemiology*
  • China / epidemiology
  • Decision Trees*
  • Female
  • Hospitals / statistics & numerical data
  • Humans
  • Incidence
  • Male
  • Middle Aged
  • Models, Statistical
  • Predictive Value of Tests
  • Retrospective Studies
  • Risk Factors
  • Stroke / epidemiology*
  • Stroke / etiology