Mining Social Media to Identify Heat Waves

Int J Environ Res Public Health. 2019 Mar 2;16(5):762. doi: 10.3390/ijerph16050762.


Heat waves are one of the deadliest of natural hazards and their frequency and intensity will likely increase as the climate continues to warm. A challenge in studying these phenomena is the lack of a universally accepted quantitative definition that captures both temperature anomalies and associated mortality. We test the hypothesis that social media mining can be used to identify heat wave mortality. Applying the approach to India, we find that the number of heat-related tweets correlates with heat-related mortality much better than traditional climate-based indicators, especially at larger scales, which identify many heat wave days that do not lead to excess mortality. We conclude that social media based heat wave identification can complement climatic data and can be used to: (1) study heat wave impacts at large scales or in developing countries, where mortality data are difficult to obtain and uncertain, and (2) to track dangerous heat wave events in real time.

Keywords: Twitter mining; heatwave; heatwave definition; social media.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining / methods*
  • Extreme Heat*
  • Humans
  • India
  • Social Media*