Open data mining for Taiwan's dengue epidemic

Acta Trop. 2018 Jul:183:1-7. doi: 10.1016/j.actatropica.2018.03.017. Epub 2018 Mar 13.

Abstract

By using a quantitative approach, this study examines the applicability of data mining technique to discover knowledge from open data related to Taiwan's dengue epidemic. We compare results when Google trend data are included or excluded. Data sources are government open data, climate data, and Google trend data. Research findings from analysis of 70,914 cases are obtained. Location and time (month) in open data show the highest classification power followed by climate variables (temperature and humidity), whereas gender and age show the lowest values. Both prediction accuracy and simplicity decrease when Google trends are considered (respectively 0.94 and 0.37, compared to 0.96 and 0.46). The article demonstrates the value of open data mining in the context of public health care.

Keywords: Data mining; Dengue epidemic; Google trend; Open data; Simplicity.

Publication types

  • Review

MeSH terms

  • Climate
  • Data Mining*
  • Dengue / epidemiology*
  • Epidemics / statistics & numerical data*
  • Evaluation Studies as Topic
  • Humans
  • Public Health*
  • Taiwan / epidemiology
  • Temperature