Comparison of the predictive qualities of three prognostic models of colorectal cancer

Front Biosci (Elite Ed). 2010 Jun 1;2:849-56. doi: 10.2741/e146.

Abstract

Most discoveries of cancer biomarkers involve construction of a single model to determine predictions of survival.. 'Data-mining' techniques, such as artificial neural networks (ANNs), perform better than traditional methods, such as logistic regression. In this study, the quality of multiple predictive models built on a molecular data set for colorectal cancer (CRC) was evaluated. Predictive models (logistic regressions, ANNs, and decision trees) were compared, and the effect of techniques for variable selection on the predictive quality of these models was investigated. The Kolmogorov-Smirnoff (KS) statistic was used to compare the models. Overall, the logistic regression and ANN methods outperformed use of a decision tree. In some instances (e.g., for a model that included 'all variables without tumor stage' and use of a decision tree for variable selection), the ANN marginally outperformed logistic regression, although the difference between the accuracy of the KS statistic was minimal (0.80 versus 0.82). Regardless of the variable(s) and the methods for variable selection, all three predictive models identified survivors and non-survivors with the same level of statistical accuracy.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Colorectal Neoplasms / metabolism
  • Colorectal Neoplasms / pathology*
  • Follow-Up Studies
  • Humans
  • Immunohistochemistry
  • Logistic Models
  • Models, Biological*
  • Neural Networks, Computer
  • Prognosis