BUSCO: Assessing Genome Assembly and Annotation Completeness

Methods Mol Biol. 2019;1962:227-245. doi: 10.1007/978-1-4939-9173-0_14.

Abstract

Genomics drives the current progress in molecular biology, generating unprecedented volumes of data. The scientific value of these sequences depends on the ability to evaluate their completeness using a biologically meaningful approach. Here, we describe the use of the BUSCO tool suite to assess the completeness of genomes, gene sets, and transcriptomes, using their gene content as a complementary method to common technical metrics. The chapter introduces the concept of universal single-copy genes, which underlies the BUSCO methodology, covers the basic requirements to set up the tool, and provides guidelines to properly design the analyses, run the assessments, and interpret and utilize the results.

Keywords: BUSCO; Gene content; Genome completeness; Orthologs; Phylogenomics; Quality assessment.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Genetic
  • Gene Dosage
  • Genome
  • Genomics / methods*
  • Internet
  • Markov Chains
  • Molecular Sequence Annotation / methods*
  • Software*
  • Transcriptome