Graph accordance of next-generation sequence assemblies
- PMID: 22025481
- PMCID: PMC3244760
- DOI: 10.1093/bioinformatics/btr588
Graph accordance of next-generation sequence assemblies
Abstract
Motivation: No individual assembly algorithm addresses all the known limitations of assembling short-length sequences. Overall reduced sequence contig length is the major problem that challenges the usage of these assemblies. We describe an algorithm to take advantages of different assembly algorithms or sequencing platforms to improve the quality of next-generation sequence (NGS) assemblies.
Results: The algorithm is implemented as a graph accordance assembly (GAA) program. The algorithm constructs an accordance graph to capture the mapping information between the target and query assemblies. Based on the accordance graph, the contigs or scaffolds of the target assembly can be extended, merged or bridged together. Extra constraints, including gap sizes, mate pairs, scaffold order and orientation, are explored to enforce those accordance operations in the correct context. We applied GAA to various chicken NGS assemblies and the results demonstrate improved contiguity statistics and higher genome and gene coverage.
Availability: GAA is implemented in OO perl and is available here: http://sourceforge.net/projects/gaa-wugi/.
Contact: lye@genome.wustl.edu
Figures
Similar articles
-
GRASS: a generic algorithm for scaffolding next-generation sequencing assemblies.Bioinformatics. 2012 Jun 1;28(11):1429-37. doi: 10.1093/bioinformatics/bts175. Epub 2012 Apr 6. Bioinformatics. 2012. PMID: 22492642
-
SOPRA: Scaffolding algorithm for paired reads via statistical optimization.BMC Bioinformatics. 2010 Jun 24;11:345. doi: 10.1186/1471-2105-11-345. BMC Bioinformatics. 2010. PMID: 20576136 Free PMC article.
-
GAM-NGS: genomic assemblies merger for next generation sequencing.BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-14-S7-S6. Epub 2013 Apr 22. BMC Bioinformatics. 2013. PMID: 23815503 Free PMC article.
-
Trypanosoma cruzi Genome Assemblies: Challenges and Milestones of Assembling a Highly Repetitive and Complex Genome.Methods Mol Biol. 2019;1955:1-22. doi: 10.1007/978-1-4939-9148-8_1. Methods Mol Biol. 2019. PMID: 30868515 Review.
-
Next-generation sequencing technologies and fragment assembly algorithms.Methods Mol Biol. 2012;855:155-74. doi: 10.1007/978-1-61779-582-4_5. Methods Mol Biol. 2012. PMID: 22407708 Review.
Cited by
-
TransBorrow: genome-guided transcriptome assembly by borrowing assemblies from different assemblers.Genome Res. 2020 Aug;30(8):1181-1190. doi: 10.1101/gr.257766.119. Epub 2020 Aug 17. Genome Res. 2020. PMID: 32817072 Free PMC article.
-
MAC: Merging Assemblies by Using Adjacency Algebraic Model and Classification.Front Genet. 2020 Jan 31;10:1396. doi: 10.3389/fgene.2019.01396. eCollection 2019. Front Genet. 2020. PMID: 32082361 Free PMC article.
-
Draft Genome Sequence of Bacillus marisflavi CK-NBRI-03, Isolated from Agricultural Soil.Microbiol Resour Announc. 2020 Feb 13;9(7):e00044-20. doi: 10.1128/MRA.00044-20. Microbiol Resour Announc. 2020. PMID: 32054702 Free PMC article.
-
Extensive chromosomal rearrangements and rapid evolution of novel effector superfamilies contribute to host adaptation and speciation in the basal ascomycetous fungi.Mol Plant Pathol. 2020 Mar;21(3):330-348. doi: 10.1111/mpp.12899. Epub 2020 Jan 8. Mol Plant Pathol. 2020. PMID: 31916390 Free PMC article.
-
Draft Genome Sequence of a Potential Plant Growth-Promoting Rhizobacterium, Pseudomonas sp. Strain CK-NBRI-02.Microbiol Resour Announc. 2019 Oct 24;8(43):e01113-19. doi: 10.1128/MRA.01113-19. Microbiol Resour Announc. 2019. PMID: 31649082 Free PMC article.
References
-
- Casagrande A., et al. IEEE International Conference on Bioinformatics and Biomedicine (BIBM). Washington, DC: 2009. GAM: genomics assemblies merger: a graph based method to integrate different assemblies; pp. 321β326.
-
- Consortium I.C.G.S. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004;432:695β716. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous
