GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis
- PMID: 24096415
- PMCID: PMC3837814
- DOI: 10.1128/AEM.02411-13
GET_HOMOLOGUES, a versatile software package for scalable and robust microbial pangenome analysis
Abstract
GET_HOMOLOGUES is an open-source software package that builds on popular orthology-calling approaches making highly customizable and detailed pangenome analyses of microorganisms accessible to nonbioinformaticians. It can cluster homologous gene families using the bidirectional best-hit, COGtriangles, or OrthoMCL clustering algorithms. Clustering stringency can be adjusted by scanning the domain composition of proteins using the HMMER3 package, by imposing desired pairwise alignment coverage cutoffs, or by selecting only syntenic genes. The resulting homologous gene families can be made even more robust by computing consensus clusters from those generated by any combination of the clustering algorithms and filtering criteria. Auxiliary scripts make the construction, interrogation, and graphical display of core genome and pangenome sets easy to perform. Exponential and binomial mixture models can be fitted to the data to estimate theoretical core genome and pangenome sizes, and high-quality graphics can be generated. Furthermore, pangenome trees can be easily computed and basic comparative genomics performed to identify lineage-specific genes or gene family expansions. The software is designed to take advantage of modern multiprocessor personal computers as well as computer clusters to parallelize time-consuming tasks. To demonstrate some of these capabilities, we survey a set of 50 Streptococcus genomes annotated in the Orthologous Matrix (OMA) browser as a benchmark case. The package can be downloaded at http://www.eead.csic.es/compbio/soft/gethoms.php and http://maya.ccg.unam.mx/soft/gethoms.php.
Figures
Similar articles
-
Robust identification of orthologues and paralogues for microbial pan-genomics using GET_HOMOLOGUES: a case study of pIncA/C plasmids.Methods Mol Biol. 2015;1231:203-32. doi: 10.1007/978-1-4939-1720-4_14. Methods Mol Biol. 2015. PMID: 25343868
-
MetaPGN: a pipeline for construction and graphical visualization of annotated pangenome networks.Gigascience. 2018 Nov 1;7(11):giy121. doi: 10.1093/gigascience/giy121. Gigascience. 2018. PMID: 30277499 Free PMC article.
-
Hierarchical sets: analyzing pangenome structure through scalable set visualizations.Bioinformatics. 2017 Jun 1;33(11):1604-1612. doi: 10.1093/bioinformatics/btx034. Bioinformatics. 2017. PMID: 28130242 Free PMC article.
-
Pangenome Graphs.Annu Rev Genomics Hum Genet. 2020 Aug 31;21:139-162. doi: 10.1146/annurev-genom-120219-080406. Epub 2020 May 26. Annu Rev Genomics Hum Genet. 2020. PMID: 32453966 Free PMC article. Review.
-
Challenges in gene-oriented approaches for pangenome content discovery.Brief Bioinform. 2021 May 20;22(3):bbaa198. doi: 10.1093/bib/bbaa198. Brief Bioinform. 2021. PMID: 32893299 Review.
Cited by
-
Pangenome and Phylogenomic Analysis of the Pathogenic Actinobacterium Rhodococcus equi.Genome Biol Evol. 2016 Oct 23;8(10):3140-3148. doi: 10.1093/gbe/evw222. Genome Biol Evol. 2016. PMID: 27638249 Free PMC article.
-
Description of Nocardioides piscis sp. nov., Sphingomonas piscis sp. nov. and Sphingomonas sinipercae sp. nov., isolated from the intestine of fish species Odontobutis interrupta (Korean spotted sleeper) and Siniperca scherzeri (leopard mandarin fish).J Microbiol. 2021 Jun;59(6):552-562. doi: 10.1007/s12275-021-1036-5. Epub 2021 Apr 20. J Microbiol. 2021. PMID: 33877575
-
Comparative Genomic Analyses of the Genus Photobacterium Illuminate Biosynthetic Gene Clusters Associated with Antagonism.Int J Mol Sci. 2022 Aug 26;23(17):9712. doi: 10.3390/ijms23179712. Int J Mol Sci. 2022. PMID: 36077108 Free PMC article.
-
Genome Reduction and Secondary Metabolism of the Marine Sponge-Associated Cyanobacterium Leptothoe.Mar Drugs. 2021 May 24;19(6):298. doi: 10.3390/md19060298. Mar Drugs. 2021. PMID: 34073758 Free PMC article.
-
Phylogenomics of Xanthomonas field strains infecting pepper and tomato reveals diversity in effector repertoires and identifies determinants of host specificity.Front Microbiol. 2015 Jun 3;6:535. doi: 10.3389/fmicb.2015.00535. eCollection 2015. Front Microbiol. 2015. PMID: 26089818 Free PMC article.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
