Screening for prediabetes using machine learning models

Comput Math Methods Med. 2014:2014:618976. doi: 10.1155/2014/618976. Epub 2014 Jul 16.

Abstract

The global prevalence of diabetes is rapidly increasing. Studies support the necessity of screening and interventions for prediabetes, which could result in serious complications and diabetes. This study aimed at developing an intelligence-based screening model for prediabetes. Data from the Korean National Health and Nutrition Examination Survey (KNHANES) were used, excluding subjects with diabetes. The KNHANES 2010 data (n = 4685) were used for training and internal validation, while data from KNHANES 2011 (n = 4566) were used for external validation. We developed two models to screen for prediabetes using an artificial neural network (ANN) and support vector machine (SVM) and performed a systematic evaluation of the models using internal and external validation. We compared the performance of our models with that of a screening score model based on logistic regression analysis for prediabetes that had been developed previously. The SVM model showed the areas under the curve of 0.731 in the external datasets, which is higher than those of the ANN model (0.729) and the screening score model (0.712), respectively. The prescreening methods developed in this study performed better than the screening score model that had been developed previously and may be more effective method for prediabetes screening.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Area Under Curve
  • Humans
  • Male
  • Neural Networks, Computer*
  • Prediabetic State / diagnosis*
  • ROC Curve
  • Random Allocation
  • Republic of Korea
  • Risk Factors
  • Support Vector Machine*