Genomic copy number variability at the genus, species and population levels impacts in situ ecological analyses of dinoflagellates and harmful algal blooms

ISME Commun. 2023 Jul 8;3(1):70. doi: 10.1038/s43705-023-00274-0.

Abstract

The application of meta-barcoding, qPCR, and metagenomics to aquatic eukaryotic microbial communities requires knowledge of genomic copy number variability (CNV). CNV may be particularly relevant to functional genes, impacting dosage and expression, yet little is known of the scale and role of CNV in microbial eukaryotes. Here, we quantify CNV of rRNA and a gene involved in Paralytic Shellfish Toxin (PST) synthesis (sxtA4), in 51 strains of 4 Alexandrium (Dinophyceae) species. Genomes varied up to threefold within species and ~7-fold amongst species, with the largest (A. pacificum, 130 ± 1.3 pg cell-1 /~127 Gbp) in the largest size category of any eukaryote. Genomic copy numbers (GCN) of rRNA varied by 6 orders of magnitude amongst Alexandrium (102- 108 copies cell-1) and were significantly related to genome size. Within the population CNV of rRNA was 2 orders of magnitude (105 - 107 cell-1) in 15 isolates from one population, demonstrating that quantitative data based on rRNA genes needs considerable caution in interpretation, even if validated against locally isolated strains. Despite up to 30 years in laboratory culture, rRNA CNV and genome size variability were not correlated with time in culture. Cell volume was only weakly associated with rRNA GCN (20-22% variance explained across dinoflagellates, 4% in Gonyaulacales). GCN of sxtA4 varied from 0-102 copies cell-1, was significantly related to PSTs (ng cell-1), displaying a gene dosage effect modulating PST production. Our data indicate that in dinoflagellates, a major marine eukaryotic group, low-copy functional genes are more reliable and informative targets for quantification of ecological processes than unstable rRNA genes.