IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth
- PMID: 22495754
- DOI: 10.1093/bioinformatics/bts174
IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth
Abstract
Motivation: Next-generation sequencing allows us to sequence reads from a microbial environment using single-cell sequencing or metagenomic sequencing technologies. However, both technologies suffer from the problem that sequencing depth of different regions of a genome or genomes from different species are highly uneven. Most existing genome assemblers usually have an assumption that sequencing depths are even. These assemblers fail to construct correct long contigs.
Results: We introduce the IDBA-UD algorithm that is based on the de Bruijn graph approach for assembling reads from single-cell sequencing or metagenomic sequencing technologies with uneven sequencing depths. Several non-trivial techniques have been employed to tackle the problems. Instead of using a simple threshold, we use multiple depthrelative thresholds to remove erroneous k-mers in both low-depth and high-depth regions. The technique of local assembly with paired-end information is used to solve the branch problem of low-depth short repeat regions. To speed up the process, an error correction step is conducted to correct reads of high-depth regions that can be aligned to highconfident contigs. Comparison of the performances of IDBA-UD and existing assemblers (Velvet, Velvet-SC, SOAPdenovo and Meta-IDBA) for different datasets, shows that IDBA-UD can reconstruct longer contigs with higher accuracy.
Availability: The IDBA-UD toolkit is available at our website http://www.cs.hku.hk/~alse/idba_ud
Similar articles
-
Meta-IDBA: a de Novo assembler for metagenomic data.Bioinformatics. 2011 Jul 1;27(13):i94-101. doi: 10.1093/bioinformatics/btr216. Bioinformatics. 2011. PMID: 21685107 Free PMC article.
-
IDBA-MT: de novo assembler for metatranscriptomic data generated from next-generation sequencing technology.J Comput Biol. 2013 Jul;20(7):540-50. doi: 10.1089/cmb.2013.0042. J Comput Biol. 2013. PMID: 23829653
-
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015. Front Bioeng Biotechnol. 2015. PMID: 26442255 Free PMC article.
-
Sequence assembly using next generation sequencing data--challenges and solutions.Sci China Life Sci. 2014 Nov;57(11):1140-8. doi: 10.1007/s11427-014-4752-9. Epub 2014 Oct 17. Sci China Life Sci. 2014. PMID: 25326069 Review.
-
Assessment of metagenomic assemblers based on hybrid reads of real and simulated metagenomic sequences.Brief Bioinform. 2020 May 21;21(3):777-790. doi: 10.1093/bib/bbz025. Brief Bioinform. 2020. PMID: 30860572 Free PMC article. Review.
Cited by
-
Higher pathogen load in children from Mozambique vs. USA revealed by comparative fecal microbiome profiling.ISME Commun. 2022 Aug 18;2(1):74. doi: 10.1038/s43705-022-00154-z. ISME Commun. 2022. PMID: 37938667 Free PMC article.
-
Sequencing Methods to Study the Microbiome with Antibiotic Resistance Genes in Patients with Pulmonary Infections.J Microbiol Biotechnol. 2024 Aug 28;34(8):1617-1626. doi: 10.4014/jmb.2402.02004. Epub 2024 Jun 20. J Microbiol Biotechnol. 2024. PMID: 39113195 Free PMC article.
-
First report of the mitogenome of Hamaxiella brunnescens (Diptera, Tachinidae) from Beijing, China.Mitochondrial DNA B Resour. 2021 Mar 17;6(3):862-864. doi: 10.1080/23802359.2021.1885321. Mitochondrial DNA B Resour. 2021. PMID: 33796659 Free PMC article.
-
Metagenome sequencing of a coastal marine microbial community from monterey bay, california.Genome Announc. 2015 Apr 30;3(2):e00341-15. doi: 10.1128/genomeA.00341-15. Genome Announc. 2015. PMID: 25931598 Free PMC article.
-
Diverse Microorganisms in Sediment and Groundwater Are Implicated in Extracellular Redox Processes Based on Genomic Analysis of Bioanode Communities.Front Microbiol. 2020 Jul 28;11:1694. doi: 10.3389/fmicb.2020.01694. eCollection 2020. Front Microbiol. 2020. PMID: 32849356 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
