Visual tools to assess the plausibility of algorithm-identified infectious disease clusters: an application to mumps data from the Netherlands dating from January 2009 to June 2016

Euro Surveill. 2019 Mar;24(12):1800331. doi: 10.2807/1560-7917.ES.2019.24.12.1800331.

Abstract

IntroductionWith growing amounts of data available, identification of clusters of persons linked to each other by transmission of an infectious disease increasingly relies on automated algorithms. We propose cluster finding to be a two-step process: first, possible transmission clusters are identified using a cluster algorithm, second, the plausibility that the identified clusters represent genuine transmission clusters is evaluated.AimTo introduce visual tools to assess automatically identified clusters.MethodsWe developed tools to visualise: (i) clusters found in dimensions of time, geographical location and genetic data; (ii) nested sub-clusters within identified clusters; (iii) intra-cluster pairwise dissimilarities per dimension; (iv) intra-cluster correlation between dimensions. We applied our tools to notified mumps cases in the Netherlands with available disease onset date (January 2009 - June 2016), geographical information (location of residence), and pathogen sequence data (n = 112). We compared identified clusters to clusters reported by the Netherlands Early Warning Committee (NEWC).ResultsWe identified five mumps clusters. Three clusters were considered plausible. One was questionable because, in phylogenetic analysis, genetic sequences related to it segregated in two groups. One was implausible with no smaller nested clusters, high intra-cluster dissimilarities on all dimensions, and low intra-cluster correlation between dimensions. The NEWC reports concurred with our findings: the plausible/questionable clusters corresponded to reported outbreaks; the implausible cluster did not.ConclusionOur tools for assessing automatically identified clusters allow outbreak investigators to rapidly spot plausible transmission clusters for mumps and other human-to-human transmissible diseases. This fast information processing potentially reduces workload.

Keywords: algorithms; cluster identification; mumps; phylogenetic analysis; plausibility assessment; transmission.

MeSH terms

  • Adolescent
  • Adult
  • Algorithms
  • Child
  • Child, Preschool
  • Cluster Analysis
  • Disease Notification / statistics & numerical data*
  • Disease Outbreaks*
  • Humans
  • Middle Aged
  • Molecular Epidemiology / methods*
  • Molecular Sequence Data
  • Mumps / epidemiology
  • Mumps / virology*
  • Mumps virus / genetics*
  • Mumps virus / isolation & purification
  • Netherlands / epidemiology
  • Phylogeny
  • RNA, Viral / genetics*
  • Sequence Analysis, DNA
  • Young Adult

Substances

  • RNA, Viral