Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Mar 22:10:16.
doi: 10.1186/1472-6947-10-16.

Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes

Affiliations
Free PMC article

Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes

Wei Yu et al. BMC Med Inform Decis Mak. .
Free PMC article

Abstract

Background: We present a potentially useful alternative approach based on support vector machine (SVM) techniques to classify persons with and without common diseases. We illustrate the method to detect persons with diabetes and pre-diabetes in a cross-sectional representative sample of the U.S. population.

Methods: We used data from the 1999-2004 National Health and Nutrition Examination Survey (NHANES) to develop and validate SVM models for two classification schemes: Classification Scheme I (diagnosed or undiagnosed diabetes vs. pre-diabetes or no diabetes) and Classification Scheme II (undiagnosed diabetes or pre-diabetes vs. no diabetes). The SVM models were used to select sets of variables that would yield the best classification of individuals into these diabetes categories.

Results: For Classification Scheme I, the set of diabetes-related variables with the best classification performance included family history, age, race and ethnicity, weight, height, waist circumference, body mass index (BMI), and hypertension. For Classification Scheme II, two additional variables--sex and physical activity--were included. The discriminative abilities of the SVM models for Classification Schemes I and II, according to the area under the receiver operating characteristic (ROC) curve, were 83.5% and 73.2%, respectively. The web-based tool-Diabetes Classifier was developed to demonstrate a user-friendly application that allows for individual or group assessment with a configurable, user-defined threshold.

Conclusions: Support vector machine modeling is a promising classification approach for detecting persons with common diseases such as diabetes and pre-diabetes in the population. This approach should be further explored in other complex diseases using common variables.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Demonstration of finding a separating hyperplane in high dimensional space vs in low dimensional space.
Figure 2
Figure 2
ROC curves for Classifications Schemes I (a) and II (b) with SVM models and logistic regression models. Note: see Table 1 for the definitions of Classification Schemes I and II.

Similar articles

Cited by

References

    1. Cortes C, Vapnik V. Support-vector networks. Machine Learning. 1995;20:273–297.
    1. Ng KL, Mishra SK. De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures. Bioinformatics. 2007;23:1321–1330. doi: 10.1093/bioinformatics/btm026. - DOI - PubMed
    1. Rice SB, Nenadic G, Stapley BJ. Mining protein function from text using term-based support vector machines. BMC Bioinformatics. 2005;6(Suppl 1):S22. doi: 10.1186/1471-2105-6-S1-S22. - DOI - PMC - PubMed
    1. Maglogiannis I, Loukis E, Zafiropoulos E, Stasis A. Support Vectors Machine-based identification of heart valve diseases using heart sounds. Comput Methods Programs Biomed. 2009;95:47–61. doi: 10.1016/j.cmpb.2009.01.003. - DOI - PubMed
    1. Thurston RC, Matthews KA, Hernandez J, De La TF. Improving the performance of physiologic hot flash measures with support vector machines. Psychophysiology. 2009;46:285–292. doi: 10.1111/j.1469-8986.2008.00770.x. - DOI - PMC - PubMed