Hierarchical classification of large-scale patient records for automatic treatment stratification

IEEE J Biomed Health Inform. 2015 Jul;19(4):1234-45. doi: 10.1109/JBHI.2015.2414876. Epub 2015 Mar 19.

Abstract

In this paper, a hierarchical learning algorithm is developed for classifying large-scale patient records, e.g., categorizing large-scale patient records into large numbers of known patient categories (i.e., thousands of known patient categories) for automatic treatment stratification. Our hierarchical learning algorithm can leverage tree structure to train more discriminative max-margin classifiers for high-level nodes and control interlevel error propagation effectively. By ruling out unlikely groups of patient categories (i.e., irrelevant high-level nodes) at an early stage, our hierarchical approach can achieve log-linear computational complexity, which is very attractive for big data applications. Our experiments on one specific medical domain have demonstrated that our hierarchical approach can achieve very competitive results on both classification accuracy and computational efficiency as compared with other state-of-the-art techniques.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Decision Trees
  • Electronic Health Records / classification*
  • Humans
  • Medical Informatics Computing*
  • Support Vector Machine*