Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Nov 15;33(22):3635-3637.
doi: 10.1093/bioinformatics/btx445.

gVolante for standardizing completeness assessment of genome and transcriptome assemblies

Affiliations

gVolante for standardizing completeness assessment of genome and transcriptome assemblies

Osamu Nishimura et al. Bioinformatics. .

Abstract

Motivation: Along with the increasing accessibility to comprehensive sequence information, such as whole genomes and transcriptomes, the demand for assessing their quality has been multiplied. To this end, metrics based on sequence lengths, such as N50, have become a standard, but they only evaluate one aspect of assembly quality. Conversely, analyzing the coverage of pre-selected reference protein-coding genes provides essential content-based quality assessment, but the currently available pipelines for this purpose, CEGMA and BUSCO, do not have a user-friendly interface to serve as a uniform environment for assembly completeness assessment.

Results: Here, we introduce a brand-new web server, gVolante, which provides an online tool for (i) on-demand completeness assessment of sequence sets by means of the previously developed pipelines CEGMA and BUSCO and (ii) browsing pre-computed completeness scores for publicly available data in its database section. Completeness assessments performed on gVolante report scores based on not just the coverage of reference genes but also on sequence lengths (e.g. N50 scaffold length), allowing quality control in multiple aspects. Using gVolante, one can compare the quality of original assemblies between their multiple versions (obtained through program choice and parameter tweaking, for example) and evaluate them in comparison to the scores of public resources found in the database section.

Availability and implementation: gVoalte is freely available at https://gvolante.riken.jp/.

Contact: shigehiro.kuraku@riken.jp.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Functions of gVolante. The web server provides two functions, ‘Analysis’ in the upper row and ‘Database’ in the lower row. Using gVolante, one can compare the quality of original assemblies and evaluate them in comparison to the scores of public resources found in the database section, for content-based decision-making for more comprehensive downstream analyses

Similar articles

Cited by

References

    1. Bradnam K.R. et al. (2013) Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. GigaScience, 2, 10. - PMC - PubMed
    1. Hara Y. et al. (2015) Optimizing and benchmarking de novo transcriptome sequencing: from library preparation to assembly evaluation. BMC Genomics, 16, 977. - PMC - PubMed
    1. Kuraku S. et al. (2013) aLeaves facilitates on-demand exploration of metazoan gene family trees on MAFFT sequence alignment server with enhanced interactivity. Nucleic Acids Res., 41, W22–W28. - PMC - PubMed
    1. Parra G. et al. (2007) CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics, 23, 1061–1067. - PubMed
    1. Parra G. et al. (2009) Assessing the gene space in draft genomes. Nucleic Acids Res., 37, 289–297. - PMC - PubMed