Explanatory Approach for Evaluation of Machine Learning-Induced Knowledge

J Int Med Res. Sep-Oct 2009;37(5):1543-51. doi: 10.1177/147323000903700532.


Progress in biomedical research has resulted in an explosive growth of data. Use of the world wide web for sharing data has opened up possibilities for exhaustive data mining analysis. Symbolic machine learning approaches used in data mining, especially ensemble approaches, produce large sets of patterns that need to be evaluated. Manual evaluation of all patterns by a human expert is almost impossible. We propose a new approach to the evaluation of machine learning-induced knowledge by introducing a pre-evaluation step. Pre-evaluation is the automatic evaluation of patterns obtained from the data mining phase, using text mining techniques and sentiment analysis. It is used as a filter for patterns according to the support found in online resources, such as publicly-available repositories of scientific papers and reports related to the problem. The domain expert can then more easily distinguish between patterns or rules that are potential candidates for new knowledge.

Publication types

  • Review

MeSH terms

  • Artificial Intelligence*
  • Computer-Assisted Instruction*
  • Data Mining
  • Humans
  • Knowledge*