454 pyrosequencing to describe microbial eukaryotic community composition, diversity and relative abundance: a test for marine haptophytes

PLoS One. 2013 Sep 12;8(9):e74371. doi: 10.1371/journal.pone.0074371. eCollection 2013.

Abstract

Next generation sequencing of ribosomal DNA is increasingly used to assess the diversity and structure of microbial communities. Here we test the ability of 454 pyrosequencing to detect the number of species present, and assess the relative abundance in terms of cell numbers and biomass of protists in the phylum Haptophyta. We used a mock community consisting of equal number of cells of 11 haptophyte species and compared targeting DNA and RNA/cDNA, and two different V4 SSU rDNA haptophyte-biased primer pairs. Further, we tested four different bioinformatic filtering methods to reduce errors in the resulting sequence dataset. With sequencing depth of 11000-20000 reads and targeting cDNA with Haptophyta specific primers Hap454 we detected all 11 species. A rarefaction analysis of expected number of species recovered as a function of sampling depth suggested that minimum 1400 reads were required here to recover all species in the mock community. Relative read abundance did not correlate to relative cell numbers. Although the species represented with the largest biomass was also proportionally most abundant among the reads, there was generally a weak correlation between proportional read abundance and proportional biomass of the different species, both with DNA and cDNA as template. The 454 sequencing generated considerable spurious diversity, and more with cDNA than DNA as template. With initial filtering based only on match with barcode and primer we observed 100-fold more operational taxonomic units (OTUs) at 99% similarity than the number of species present in the mock community. Filtering based on quality scores, or denoising with PyroNoise resulted in ten times more OTU99% than the number of species. Denoising with AmpliconNoise reduced the number of OTU99% to match the number of species present in the mock community. Based on our analyses, we propose a strategy to more accurately depict haptophyte diversity using 454 pyrosequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biodiversity*
  • DNA, Complementary
  • DNA, Ribosomal
  • Haptophyta / classification*
  • Haptophyta / genetics*
  • High-Throughput Nucleotide Sequencing* / methods
  • High-Throughput Nucleotide Sequencing* / standards
  • Metagenome*
  • Phylogeny

Substances

  • DNA, Complementary
  • DNA, Ribosomal

Grants and funding

Financial support was given by the Norwegian Research Council (www.forskningsradet.no), through grant 190307 HAPTODIV to ESE and BE, by ASSEMBLE (www.assemblemarine.org) FP7 grant agreement no 227799 (research stay for ESE at the SBR), and through the EU project BioMarKs (www.biomarks.eu) to LB, CdV and BE. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.