Automatically classifying the evidence type of drug-drug interaction research papers as a step toward computer supported evidence curation

AMIA Annu Symp Proc. 2021 Jan 25:2020:554-563. eCollection 2020.

Abstract

A longstanding issue with knowledge bases that discuss drug-drug interactions (DDIs) is that they are inconsistent with one another. Computerized support might help experts be more objective in assessing DDI evidence. A requirement for such systems is accurate automatic classification of evidence types. In this pilot study, we developed a hierarchical classifier to classify clinical DDI studies into formally defined evidence types. The area under the ROC curve for sub-classifiers in the ensemble ranged from 0.78 to 0.87. The entire system achieved an F1 of 0.83 and 0.63 on two held-out datasets, the latter consisting focused on completely novel drugs from what the system was trained on. The results suggest that it is feasible to accurately automate the classification of a sub-set of DDI evidence types and that the hierarchical approach shows promise. Future work will test more advanced feature engineering techniques while expanding the system to classify a more complex set of evidence types.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computers
  • Data Mining / methods*
  • Data Mining / statistics & numerical data
  • Databases, Factual* / statistics & numerical data
  • Drug Interactions*
  • Humans
  • Machine Learning*
  • Natural Language Processing
  • Pilot Projects
  • Publications*