gVolante for standardizing completeness assessment of genome and transcriptome assemblies
- PMID: 29036533
- PMCID: PMC5870689
- DOI: 10.1093/bioinformatics/btx445
gVolante for standardizing completeness assessment of genome and transcriptome assemblies
Abstract
Motivation: Along with the increasing accessibility to comprehensive sequence information, such as whole genomes and transcriptomes, the demand for assessing their quality has been multiplied. To this end, metrics based on sequence lengths, such as N50, have become a standard, but they only evaluate one aspect of assembly quality. Conversely, analyzing the coverage of pre-selected reference protein-coding genes provides essential content-based quality assessment, but the currently available pipelines for this purpose, CEGMA and BUSCO, do not have a user-friendly interface to serve as a uniform environment for assembly completeness assessment.
Results: Here, we introduce a brand-new web server, gVolante, which provides an online tool for (i) on-demand completeness assessment of sequence sets by means of the previously developed pipelines CEGMA and BUSCO and (ii) browsing pre-computed completeness scores for publicly available data in its database section. Completeness assessments performed on gVolante report scores based on not just the coverage of reference genes but also on sequence lengths (e.g. N50 scaffold length), allowing quality control in multiple aspects. Using gVolante, one can compare the quality of original assemblies between their multiple versions (obtained through program choice and parameter tweaking, for example) and evaluate them in comparison to the scores of public resources found in the database section.
Availability and implementation: gVoalte is freely available at https://gvolante.riken.jp/.
Contact: shigehiro.kuraku@riken.jp.
© The Author 2017. Published by Oxford University Press.
Figures
Similar articles
-
Evaluating Genome Assemblies and Gene Models Using gVolante.Methods Mol Biol. 2019;1962:247-256. doi: 10.1007/978-1-4939-9173-0_15. Methods Mol Biol. 2019. PMID: 31020565
-
Assessing genome assembly quality prior to downstream analysis: N50 versus BUSCO.Mol Ecol Resour. 2021 Jul;21(5):1416-1421. doi: 10.1111/1755-0998.13364. Epub 2021 Mar 9. Mol Ecol Resour. 2021. PMID: 33629477
-
BUSCO: Assessing Genome Assembly and Annotation Completeness.Methods Mol Biol. 2019;1962:227-245. doi: 10.1007/978-1-4939-9173-0_14. Methods Mol Biol. 2019. PMID: 31020564
-
[Transcriptomes for serial analysis of gene expression].J Soc Biol. 2002;196(4):303-7. J Soc Biol. 2002. PMID: 12645300 Review. French.
-
A proposed metric set for evaluation of genome assembly quality.Trends Genet. 2023 Mar;39(3):175-186. doi: 10.1016/j.tig.2022.10.005. Epub 2022 Nov 17. Trends Genet. 2023. PMID: 36402623 Review.
Cited by
-
Convergent genomic signatures associated with vertebrate viviparity.BMC Biol. 2024 Feb 8;22(1):34. doi: 10.1186/s12915-024-01837-w. BMC Biol. 2024. PMID: 38331819 Free PMC article.
-
Lack of Dosage Balance and Incomplete Dosage Compensation in the ZZ/ZW Gila Monster (Heloderma suspectum) Revealed by De Novo Genome Assembly.Genome Biol Evol. 2024 Mar 2;16(3):evae018. doi: 10.1093/gbe/evae018. Genome Biol Evol. 2024. PMID: 38319079 Free PMC article.
-
Chemosensory Receptor Expression in the Abdomen Tip of the Female Codling Moth, Cydia pomonella L. (Lepidoptera: Tortricidae).Insects. 2023 Dec 14;14(12):948. doi: 10.3390/insects14120948. Insects. 2023. PMID: 38132621 Free PMC article.
-
Chromosomal scale assembly reveals localized structural variants in avian caecal coccidian parasite Eimeria tenella.Sci Rep. 2023 Dec 20;13(1):22802. doi: 10.1038/s41598-023-50117-0. Sci Rep. 2023. PMID: 38129566 Free PMC article.
-
Genome characterization and taxonomy of Actinomyces acetigenes sp. nov., and Actinomyces stomatis sp. nov., previously isolated from the human oral cavity.BMC Genomics. 2023 Dec 4;24(1):734. doi: 10.1186/s12864-023-09831-2. BMC Genomics. 2023. PMID: 38049764 Free PMC article.
References
-
- Parra G. et al. (2007) CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics, 23, 1061–1067. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
