Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan 27:7:41498.
doi: 10.1038/srep41498.

Accessing the genomic information of unculturable oceanic picoeukaryotes by combining multiple single cells

Affiliations

Accessing the genomic information of unculturable oceanic picoeukaryotes by combining multiple single cells

Jean-François Mangot et al. Sci Rep. .

Abstract

Pico-sized eukaryotes play key roles in the functioning of marine ecosystems, but we still have a limited knowledge on their ecology and evolution. The MAST-4 lineage is of particular interest, since it is widespread in surface oceans, presents ecotypic differentiation and has defied culturing efforts so far. Single cell genomics (SCG) are promising tools to retrieve genomic information from these uncultured organisms. However, SCG are based on whole genome amplification, which normally introduces amplification biases that limit the amount of genomic data retrieved from a single cell. Here, we increase the recovery of genomic information from two MAST-4 lineages by co-assembling short reads from multiple Single Amplified Genomes (SAGs) belonging to evolutionary closely related cells. We found that complementary genomic information is retrieved from different SAGs, generating co-assembly that features >74% of genome recovery, against about 20% when assembled individually. Even though this approach is not aimed at generating high-quality draft genomes, it allows accessing to the genomic information of microbes that would otherwise remain unreachable. Since most of the picoeukaryotes still remain uncultured, our work serves as a proof-of-concept that can be applied to other taxa in order to extract genomic data and address new ecological and evolutionary questions.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. General characteristics of the draft genomes obtained by individual SAGs.
Box plots capture the variation in assembly size (a), number of contigs (b), N50 (c) and GC content (d) among MAST-4A (n = 14) and MAST-4E (n = 9) SAGs.
Figure 2
Figure 2. Genome recovery estimated by CEGMA of SAGs in relation to the sequencing effort.
(a) Genome recovery of the 23 SAGs in relation to their sequencing depth. (b) Genome recovery at different sequencing depths in two selected SAGs (those with the largest genome in each clade). Each point represents the mean recovery after 5 separate subsamplings (at 17%, 33%, 50%, 67%, and 83%) of the total number of reads.
Figure 3
Figure 3. Comparison of tetranucleotide frequencies of SAGs in an ESOM map.
Each contig (2.5–5 kbp in size) is represented by a point placed in the map by relatedness and colored according to their provenance from SAGs of MAST-4A (bluish) or MAST-4E (reddish). Note that the map is continuous from top to bottom and side to side. Large differences in tetranucleotide frequencies (black borders) represent natural divisions between taxonomic groups. Two clusters (a and b) were identified and taxonomically assigned (see text).
Figure 4
Figure 4. Fractions of the co-assembled genomes of MAST-4A and MAST-4E shared among their respective SAGs (from 1 to 14 cells).
The contribution of each SAG was determined through a fragment recruitment analysis of their reads towards the final co-assembly.
Figure 5
Figure 5. Cumulative genome size (a) and genome recovery (b) calculated when increasing the number of SAGs used for co-assembly.
Figure 6
Figure 6. Identification of the 34 CEGs coding for proteins involved in translation, ribosomal structure and biogenesis processes within SAGs and co-assemblies of both lineages.
The presence of CEGs among SAGs and co-assembly (light grey) or solely among SAGs (dark grey) or co-assembly (black) are listed here.
Figure 7
Figure 7. Alignment of the ITS1 (a) and ITS2 (b) regions of individual SAGs and the co-assembly in MAST-4A.
Conserved nucleotides in the helices II and III of the two regions were highlighted according to ITS secondary structure models in MAST-4. Differences against a consensus sequence (not shown) are colored as red (A positions), green (T), blue (C), and yellow (G).

Similar articles

Cited by

References

    1. Giovannoni S. J. & Stingl U. Molecular diversity and ecology of microbial plankton. Nature 437, 343–348 (2005). - PubMed
    1. DeLong E. F. The microbial ocean from genomes to biomes. Nature 459, 200–206 (2009). - PubMed
    1. Falkowski P. G., Fenchel T. & Delong E. F. The microbial engines that drive Earth’s biogeochemical cycles. Science 320, 1034–1039 (2008). - PubMed
    1. Jardillier L., Zubkov M. V., Pearman J. & Scanlan D. J. Significant CO2 fixation by small prymnesiophytes in the subtropical and tropical northeast Atlantic Ocean. ISME J. 4, 1180–1192 (2010). - PubMed
    1. Sherr E. & Sherr B. Understanding roles of microbes in marine pelagic food webs: A brief history In Microbial Ecology of the Oceans (ed. Kirchman D. L.) 27–44 (Wiley-Liss., 2008).

Publication types

LinkOut - more resources