sourceR: Classification and source attribution of infectious agents among heterogeneous populations

PLoS Comput Biol. 2017 May 30;13(5):e1005564. doi: 10.1371/journal.pcbi.1005564. eCollection 2017 May.

Abstract

Zoonotic diseases are a major cause of morbidity, and productivity losses in both human and animal populations. Identifying the source of food-borne zoonoses (e.g. an animal reservoir or food product) is crucial for the identification and prioritisation of food safety interventions. For many zoonotic diseases it is difficult to attribute human cases to sources of infection because there is little epidemiological information on the cases. However, microbial strain typing allows zoonotic pathogens to be categorised, and the relative frequencies of the strain types among the sources and in human cases allows inference on the likely source of each infection. We introduce sourceR, an R package for quantitative source attribution, aimed at food-borne diseases. It implements a Bayesian model using strain-typed surveillance data from both human cases and source samples, capable of identifying important sources of infection. The model measures the force of infection from each source, allowing for varying survivability, pathogenicity and virulence of pathogen strains, and varying abilities of the sources to act as vehicles of infection. A Bayesian non-parametric (Dirichlet process) approach is used to cluster pathogen strain types by epidemiological behaviour, avoiding model overfitting and allowing detection of strain types associated with potentially high "virulence". sourceR is demonstrated using Campylobacter jejuni isolate data collected in New Zealand between 2005 and 2008. Chicken from a particular poultry supplier was identified as the major source of campylobacteriosis, which is qualitatively similar to results of previous studies using the same dataset. Additionally, the software identifies a cluster of 9 multilocus sequence types with abnormally high 'virulence' in humans. sourceR enables straightforward attribution of cases of zoonotic infection to putative sources of infection. As sourceR develops, we intend it to become an important and flexible resource for food-borne disease attribution studies.

MeSH terms

  • Animals
  • Bacteria* / classification
  • Bacteria* / pathogenicity
  • Bayes Theorem
  • Campylobacter Infections / epidemiology
  • Campylobacter Infections / microbiology
  • Campylobacter jejuni / classification
  • Campylobacter jejuni / pathogenicity
  • Computational Biology / methods
  • Foodborne Diseases* / epidemiology
  • Foodborne Diseases* / microbiology
  • Humans
  • Models, Biological*
  • New Zealand
  • Software*
  • Zoonoses* / epidemiology
  • Zoonoses* / microbiology

Grants and funding

The authors received no specific funding for this work.