Metagenomic and metatranscriptomic data were generated from size-fractionated samples from 11 sites within the Baltic Sea and adjacent marine waters of Kattegat and freshwater Lake Torneträsk in order to investigate the diversity, distribution, and transcriptional activity of virioplankton. Such a transect, spanning a salinity gradient from freshwater to the open sea, facilitated a broad genome-enabled investigation of natural as well as impacted aspects of Baltic Sea viral communities. Taxonomic signatures representative of phages within the widely distributed order Caudovirales were identified with enrichments in lesser-known families such as Podoviridae and Siphoviridae. The distribution of phage reported to infect diverse and ubiquitous heterotrophic bacteria (SAR11 clades) and cyanobacteria (Synechococcus sp.) displayed population-level shifts in diversity. Samples from higher-salinity conditions (>14 practical salinity units [PSU]) had increased abundances of viruses for picoeukaryotes, i.e., Ostreococcus. These data, combined with host diversity estimates, suggest viral modulation of diversity on the whole-community scale, as well as in specific prokaryotic and eukaryotic lineages. RNA libraries revealed single-stranded DNA (ssDNA) and RNA viral populations throughout the Baltic Sea, with ssDNA phage highly represented in Lake Torneträsk. Further, our data suggest relatively high transcriptional activity of fish viruses within diverse families known to have broad host ranges, such as Nodoviridae (RNA), Iridoviridae (DNA), and predicted zoonotic viruses that can cause ecological and economic damage as well as impact human health. IMPORTANCE Inferred virus-host relationships, community structures of ubiquitous ecologically relevant groups, and identification of transcriptionally active populations have been achieved with our Baltic Sea study. Further, these data, highlighting the transcriptional activity of viruses, represent one of the more powerful uses of omics concerning ecosystem health. The use of omics-related data to assess ecosystem health holds great promise for rapid and relatively inexpensive determination of perturbations and risk, explicitly with regard to viral assemblages, as no single marker gene is suitable for widespread taxonomic coverage.
Keywords: marine microbiology; viral ecology; viral metagenomics; viral metatranscriptomics; viral/host inference.