Identity-by-descent with uncertainty characterises connectivity of Plasmodium falciparum populations on the Colombian-Pacific coast

PLoS Genet. 2020 Nov 16;16(11):e1009101. doi: 10.1371/journal.pgen.1009101. eCollection 2020 Nov.

Abstract

Characterising connectivity between geographically separated biological populations is a common goal in many fields. Recent approaches to understanding connectivity between malaria parasite populations, with implications for disease control efforts, have used estimates of relatedness based on identity-by-descent (IBD). However, uncertainty around estimated relatedness has not been accounted for. IBD-based relatedness estimates with uncertainty were computed for pairs of monoclonal Plasmodium falciparum samples collected from five cities on the Colombian-Pacific coast where long-term clonal propagation of P. falciparum is frequent. The cities include two official ports, Buenaventura and Tumaco, that are separated geographically but connected by frequent marine traffic. Fractions of highly-related sample pairs (whose classification using a threshold accounts for uncertainty) were greater within cities versus between. However, based on both highly-related fractions and on a threshold-free approach (Wasserstein distances between parasite populations) connectivity between Buenaventura and Tumaco was disproportionally high. Buenaventura-Tumaco connectivity was consistent with transmission events involving parasites from five clonal components (groups of statistically indistinguishable parasites identified under a graph theoretic framework). To conclude, P. falciparum population connectivity on the Colombian-Pacific coast abides by accessibility not isolation-by-distance, potentially implicating marine traffic in malaria transmission with opportunities for targeted intervention. Further investigations are required to test this hypothesis. For the first time in malaria epidemiology (and to our knowledge in ecological and epidemiological studies more generally), we account for uncertainty around estimated relatedness (an important consideration for studies that plan to use genotype versus whole genome sequence data to estimate IBD-based relatedness); we also use threshold-free methods to compare parasite populations and identify clonal components. Threshold-free methods are especially important in analyses of malaria parasites and other recombining organisms with mixed mating systems where thresholds do not have clear interpretation (e.g. due to clonal propagation) and thus undermine the cross-comparison of studies.