An IR-aided machine learning framework for the BioCreative II.5 Challenge

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):454-61. doi: 10.1109/TCBB.2010.56.


The team at the University of Wisconsin-Milwaukee developed an information retrieval and machine learning framework. Our framework requires only the standardized training data and depends upon minimal external knowledge resources and minimal parsing. Within the framework, we built our text mining systems and participated for the first time in all three BioCreative II.5 Challenge tasks. The results show that our systems performed among the top five teams for raw F1 scores in all three tasks and came in third place for the homonym ortholog F1 scores for the INT task. The results demonstrated that our IR-based framework is efficient, robust, and potentially scalable.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Artificial Intelligence*
  • Computational Biology / methods*
  • Data Mining / methods*
  • Databases, Genetic*
  • Information Storage and Retrieval
  • Protein Interaction Mapping / methods
  • Wisconsin