Assessing the performance of single-copy genes for recovering robust phylogenies
- PMID: 18709599
- DOI: 10.1080/10635150802306527
Assessing the performance of single-copy genes for recovering robust phylogenies
Abstract
Phylogenies involving nonmodel species are based on a few genes, mostly chosen following historical or practical criteria. Because gene trees are sometimes incongruent with species trees, the resulting phylogenies may not accurately reflect the evolutionary relationships among species. The increase in availability of genome sequences now provides large numbers of genes that could be used for building phylogenies. However, for practical reasons only a few genes can be sequenced for a wide range of species. Here we asked whether we can identify a few genes, among the single-copy genes common to most fungal genomes, that are sufficient for recovering accurate and well-supported phylogenies. Fungi represent a model group for phylogenomics because many complete fungal genomes are available. An automated procedure was developed to extract single-copy orthologous genes from complete fungal genomes using a Markov Clustering Algorithm (Tribe-MCL). Using 21 complete, publicly available fungal genomes with reliable protein predictions, 246 single-copy orthologous gene clusters were identified. We inferred the maximum likelihood trees using the individual orthologous sequences and constructed a reference tree from concatenated protein alignments. The topologies of the individual gene trees were compared to that of the reference tree using three different methods. The performance of individual genes in recovering the reference tree was highly variable. Gene size and the number of variable sites were highly correlated and significantly affected the performance of the genes, but the average substitution rate did not. Two genes recovered exactly the same topology as the reference tree, and when concatenated provided high bootstrap values. The genes typically used for fungal phylogenies did not perform well, which suggests that current fungal phylogenies based on these genes may not accurately reflect the evolutionary relationships among species. Analyses on subsets of species showed that the phylogenetic performance did not seem to depend strongly on the sample. We expect that the best-performing genes identified here will be very useful for phylogenetic studies of fungi, at least at a large taxonomic scale. Furthermore, we compare the method developed here for finding genes for building robust phylogenies with previous ones and we advocate that our method could be applied to other groups of organisms when more complete genomes are available.
Similar articles
-
FUNYBASE: a FUNgal phYlogenomic dataBASE.BMC Bioinformatics. 2008 Oct 27;9:456. doi: 10.1186/1471-2105-9-456. BMC Bioinformatics. 2008. PMID: 18954438 Free PMC article.
-
Inferring species phylogenies from multiple genes: concatenated sequence tree versus consensus gene tree.J Exp Zool B Mol Dev Evol. 2005 Jan 15;304(1):64-74. doi: 10.1002/jez.b.21026. J Exp Zool B Mol Dev Evol. 2005. PMID: 15593277
-
Phylogenomics reveal a robust fungal tree of life.FEMS Yeast Res. 2006 Dec;6(8):1213-20. doi: 10.1111/j.1567-1364.2006.00119.x. FEMS Yeast Res. 2006. PMID: 17156018
-
The search for the fungal tree of life.Trends Microbiol. 2009 Nov;17(11):488-97. doi: 10.1016/j.tim.2009.08.001. Epub 2009 Sep 24. Trends Microbiol. 2009. PMID: 19782570 Review.
-
New perspectives in hemiascomycetous yeast taxonomy.C R Biol. 2011 Aug-Sep;334(8-9):590-8. doi: 10.1016/j.crvi.2011.05.006. Epub 2011 Jun 30. C R Biol. 2011. PMID: 21819939 Review.
Cited by
-
Phylogeny of Penicillium and the segregation of Trichocomaceae into three families.Stud Mycol. 2011 Nov 15;70(1):1-51. doi: 10.3114/sim.2011.70.01. Stud Mycol. 2011. PMID: 22308045 Free PMC article.
-
Toward a DNA taxonomy of Alpine Rhithrogena (Ephemeroptera: Heptageniidae) using a mixed Yule-coalescent analysis of mitochondrial and nuclear DNA.PLoS One. 2011;6(5):e19728. doi: 10.1371/journal.pone.0019728. Epub 2011 May 17. PLoS One. 2011. PMID: 21611178 Free PMC article.
-
Differential gene retention as an evolutionary mechanism to generate biodiversity and adaptation in yeasts.Sci Rep. 2015 Jun 25;5:11571. doi: 10.1038/srep11571. Sci Rep. 2015. PMID: 26108467 Free PMC article.
-
Finding single copy genes out of sequenced genomes for multilocus phylogenetics in non-model fungi.PLoS One. 2011 Apr 13;6(4):e18803. doi: 10.1371/journal.pone.0018803. PLoS One. 2011. PMID: 21533204 Free PMC article.
-
Identification, characterization, and utilization of single copy genes in 29 angiosperm genomes.BMC Genomics. 2014 Jun 21;15(1):504. doi: 10.1186/1471-2164-15-504. BMC Genomics. 2014. PMID: 24950957 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
