Sample size calculations for pathogen variant surveillance in the presence of biological and systematic biases

Cell Rep Med. 2023 May 16;4(5):101022. doi: 10.1016/j.xcrm.2023.101022. Epub 2023 Apr 26.

Abstract

Tracking the emergence and spread of pathogen variants is an important component of monitoring infectious disease outbreaks. To that end, accurately estimating the number and prevalence of pathogen variants in a population requires carefully designed surveillance programs. However, current approaches to calculating the number of pathogen samples needed for effective surveillance often do not account for the various processes that can bias which infections are detected and which samples are ultimately characterized as a specific variant. In this article, we introduce a framework that accounts for the logistical and epidemiological processes that may bias variant characterization, and we demonstrate how to use this framework (implemented in a publicly available tool) to calculate the number of sequences needed for surveillance. Our framework is designed to be easy to use while also flexible enough to be adapted to various pathogens and surveillance scenarios.

Keywords: SARS-CoV-2; infectious disease; pathogen genomics; pathogen variants; sample size calculations; variant surveillance; variants of concern.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bias
  • Disease Outbreaks*
  • Sample Size