Discovering Periodic Patterns in Historical News

PLoS One. 2016 Nov 8;11(11):e0165736. doi: 10.1371/journal.pone.0165736. eCollection 2016.

Abstract

We address the problem of observing periodic changes in the behaviour of a large population, by analysing the daily contents of newspapers published in the United States and United Kingdom from 1836 to 1922. This is done by analysing the daily time series of the relative frequency of the 25K most frequent words for each country, resulting in the study of 50K time series for 31,755 days. Behaviours that are found to be strongly periodic include seasonal activities, such as hunting and harvesting. A strong connection with natural cycles is found, with a pronounced presence of fruits, vegetables, flowers and game. Periodicities dictated by religious or civil calendars are also detected and show a different wave-form than those provoked by weather. States that can be revealed include the presence of infectious disease, with clear annual peaks for fever, pneumonia and diarrhoea. Overall, 2% of the words are found to be strongly periodic, and the period most frequently found is 365 days. Comparisons between UK and US, and between modern and historical news, reveal how the fundamental cycles of life are shaped by the seasons, but also how this effect has been reduced in modern times.

Publication types

  • Historical Article

MeSH terms

  • Behavior*
  • Fourier Analysis
  • History, 19th Century
  • History, 20th Century
  • Humans
  • Language
  • Newspapers as Topic / history
  • Newspapers as Topic / statistics & numerical data*
  • Periodicity*
  • Seasons
  • United Kingdom
  • United States
  • Weather

Grants and funding

A European Research Council Advanced Grant 339365 "ThinkBIG" granted to NC supported NC, TLW and FD. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.