The genome polishing tool POLCA makes fast and accurate corrections in genome assemblies
- PMID: 32589667
- PMCID: PMC7347232
- DOI: 10.1371/journal.pcbi.1007981
The genome polishing tool POLCA makes fast and accurate corrections in genome assemblies
Abstract
The introduction of third-generation DNA sequencing technologies in recent years has allowed scientists to generate dramatically longer sequence reads, which when used in whole-genome sequencing projects have yielded better repeat resolution and far more contiguous genome assemblies. While the promise of better contiguity has held true, the relatively high error rate of long reads, averaging 8-15%, has made it challenging to generate a highly accurate final sequence. Current long-read sequencing technologies display a tendency toward systematic errors, in particular in homopolymer regions, which present additional challenges. A cost-effective strategy to generate highly contiguous assemblies with a very low overall error rate is to combine long reads with low-cost short-read data, which currently have an error rate below 0.5%. This hybrid strategy can be pursued either by incorporating the short-read data into the early phase of assembly, during the read correction step, or by using short reads to "polish" the consensus built from long reads. In this report, we present the assembly polishing tool POLCA (POLishing by Calling Alternatives) and compare its performance with two other popular polishing programs, Pilon and Racon. We show that on simulated data POLCA is more accurate than Pilon, and comparable in accuracy to Racon. On real data, all three programs show similar performance, but POLCA is consistently much faster than either of the other polishing programs.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
Similar articles
-
Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.Genomics. 2021 May;113(3):1366-1377. doi: 10.1016/j.ygeno.2021.03.018. Epub 2021 Mar 11. Genomics. 2021. PMID: 33716184
-
Assembly of chloroplast genomes with long- and short-read data: a comparison of approaches using Eucalyptus pauciflora as a test case.BMC Genomics. 2018 Dec 29;19(1):977. doi: 10.1186/s12864-018-5348-8. BMC Genomics. 2018. PMID: 30594129 Free PMC article.
-
Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing.BMC Genomics. 2019 Jan 9;20(1):23. doi: 10.1186/s12864-018-5381-7. BMC Genomics. 2019. PMID: 30626323 Free PMC article.
-
Nanopore sequencing technology and tools for genome assembly: computational analysis of the current state, bottlenecks and future directions.Brief Bioinform. 2019 Jul 19;20(4):1542-1559. doi: 10.1093/bib/bby017. Brief Bioinform. 2019. PMID: 29617724 Free PMC article. Review.
-
Chromosome-level hybrid de novo genome assemblies as an attainable option for nonmodel insects.Mol Ecol Resour. 2020 Sep;20(5):1277-1293. doi: 10.1111/1755-0998.13176. Epub 2020 Jun 7. Mol Ecol Resour. 2020. PMID: 32329220 Review.
Cited by
-
An improved chromosome-level genome assembly of perennial ryegrass (Lolium perenne L.).GigaByte. 2024 Mar 6;2024:gigabyte112. doi: 10.46471/gigabyte.112. eCollection 2024. GigaByte. 2024. PMID: 38496214 Free PMC article.
-
Metagenome-assembled genomes of three Hepatoplasmataceae provide insights into isopod-mollicute symbiosis.Access Microbiol. 2024 Feb 20;6(2):000592.v3. doi: 10.1099/acmi.0.000592.v3. eCollection 2024. Access Microbiol. 2024. PMID: 38482369 Free PMC article.
-
Apis mellifera filamentous virus from a honey bee gut microbiome survey in Hungary.Sci Rep. 2024 Mar 9;14(1):5803. doi: 10.1038/s41598-024-56320-x. Sci Rep. 2024. PMID: 38461199 Free PMC article.
-
Constructing telomere-to-telomere diploid genome by polishing haploid nanopore-based assembly.Nat Methods. 2024 Mar 8. doi: 10.1038/s41592-023-02141-1. Online ahead of print. Nat Methods. 2024. PMID: 38459383
-
A 2000-Year-Old Bacillus stercoris Strain Sheds Light on the Evolution of Cyclic Antimicrobial Lipopeptide Synthesis.Microorganisms. 2024 Feb 6;12(2):338. doi: 10.3390/microorganisms12020338. Microorganisms. 2024. PMID: 38399742 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
