Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2016 Apr 12:6:24175.
doi: 10.1038/srep24175.

Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes

Affiliations
Comparative Study

Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes

Hsin-Hung Lin et al. Sci Rep. .

Abstract

Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or 'bin' sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new microbial organisms and aids in the microbial genome reconstruction process. Here we present MyCC, an automated binning tool that combines genomic signatures, marker genes and optional contig coverages within one or multiple samples, in order to visualize the metagenomes and to identify the reconstructed genomic fragments. We demonstrate the superior performance of MyCC compared to other binning tools including CONCOCT, GroopM, MaxBin and MetaBAT on both synthetic and real human gut communities with a small sample size (one to 11 samples), as well as on a large metagenome dataset (over 250 samples). Moreover, we demonstrate the visualization of metagenomes in MyCC to aid in the reconstruction of genomes from distinct bins. MyCC is freely available at http://sourceforge.net/projects/sb2nhri/files/MyCC/.

PubMed Disclaimer

Figures

Figure 1
Figure 1. An overview of the MyCC workflow and visualization.
(a) A schematic workflow for MyCC. (b) A plot of Barnes-Hut-SNE-based dimensionality reduction. (c) Automated clustering by affinity propagation. (c) Corrected clusters based on marker genes. These plots were output by MyCC in binning Sharon’s dataset (“MyCC.py carrol.fasta -a My.depth.txt -keep”).
Figure 2
Figure 2. Explanations for outputs of MyCC.
(a) Visualization of metagenomic binning. (b) A summary file produced by MyCC, reporting genome size (WholeGenome), N50, numbers of contigs (NoOfCtg) and marker genes (Cogs) for each bin. (c) Binning sequences in a cluster are output in FASTA format. (d) Gold-standard binning assignments available at MetaBAT’s website. (e) Binning performance evaluation based on the gold-standard assignments. MyCC was applied to bin a mock dataset of 25 genomes (“MyCC.py assembly.fa -a My.depth.txt”).

Similar articles

Cited by

References

    1. Hess M. et al. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science 331, 463–467, 10.1126/science.1200387 (2011). - DOI - PubMed
    1. Nielsen H. B. et al. Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes. Nat Biotechnol 32, 822–828, 10.1038/nbt.2939 (2014). - DOI - PubMed
    1. Mackelprang R. et al. Metagenomic analysis of a permafrost microbial community reveals a rapid response to thaw. Nature 480, 368–371, 10.1038/nature10576 (2011). - DOI - PubMed
    1. Iverson V. et al. Untangling genomes from metagenomes: revealing an uncultured class of marine Euryarchaeota. Science 335, 587–590, 10.1126/science.1212665 (2012). - DOI - PubMed
    1. Peng Y., Leung H. C., Yiu S. M. & Chin F. Y. Meta-IDBA: a de Novo assembler for metagenomic data. Bioinformatics 27, i94–101, 10.1093/bioinformatics/btr216 (2011). - DOI - PMC - PubMed

Publication types

LinkOut - more resources