Phylogenetic Analyses of Sites in Different Protein Structural Environments Result in Distinct Placements of the Metazoan Root
- PMID: 32231097
- PMCID: PMC7235752
- DOI: 10.3390/biology9040064
Phylogenetic Analyses of Sites in Different Protein Structural Environments Result in Distinct Placements of the Metazoan Root
Abstract
Phylogenomics, the use of large datasets to examine phylogeny, has revolutionized the study of evolutionary relationships. However, genome-scale data have not been able to resolve all relationships in the tree of life; this could reflect, at least in part, the poor-fit of the models used to analyze heterogeneous datasets. Some of the heterogeneity may reflect the different patterns of selection on proteins based on their structures. To test that hypothesis, we developed a pipeline to divide phylogenomic protein datasets into subsets based on secondary structure and relative solvent accessibility. We then tested whether amino acids in different structural environments had distinct signals for the topology of the deepest branches in the metazoan tree. We focused on a dataset that appeared to have a mixture of signals and we found that the most striking difference in phylogenetic signal reflected relative solvent accessibility. Analyses of exposed sites (residues located on the surface of proteins) yielded a tree that placed ctenophores sister to all other animals whereas sites buried inside proteins yielded a tree with a sponge+ctenophore clade. These differences in phylogenetic signal were not ameliorated when we conducted analyses using a set of maximum-likelihood profile mixture models. These models are very similar to the Bayesian CAT model, which has been used in many analyses of deep metazoan phylogeny. In contrast, analyses conducted after recoding amino acids to limit the impact of deviations from compositional stationarity increased the congruence in the estimates of phylogeny for exposed and buried sites; after recoding amino acid trees estimated using the exposed and buried site both supported placement of ctenophores sister to all other animals. Although the central conclusion of our analyses is that sites in different structural environments yield distinct trees when analyzed using models of protein evolution, our amino acid recoding analyses also have implications for metazoan evolution. Specifically, our results add to the evidence that ctenophores are the sister group of all other animals and they further suggest that the placozoa+cnidaria clade found in some other studies deserves more attention. Taken as a whole, these results provide striking evidence that it is necessary to achieve a better understanding of the constraints due to protein structure to improve phylogenetic estimation.
Keywords: Ctenophora; Porifera; RY coding; heteropecilly; metazoan phylogeny; non-stationary models; protein structure; relative solvent accessibility.
Conflict of interest statement
Authors declare no conflict of interest.
Figures
Similar articles
-
Extracting phylogenetic signal and accounting for bias in whole-genome data sets supports the Ctenophora as sister to remaining Metazoa.BMC Genomics. 2015 Nov 23;16:987. doi: 10.1186/s12864-015-2146-4. BMC Genomics. 2015. PMID: 26596625 Free PMC article.
-
Error, signal, and the placement of Ctenophora sister to all other animals.Proc Natl Acad Sci U S A. 2015 May 5;112(18):5773-8. doi: 10.1073/pnas.1503453112. Epub 2015 Apr 20. Proc Natl Acad Sci U S A. 2015. PMID: 25902535 Free PMC article.
-
A Large and Consistent Phylogenomic Dataset Supports Sponges as the Sister Group to All Other Animals.Curr Biol. 2017 Apr 3;27(7):958-967. doi: 10.1016/j.cub.2017.02.031. Epub 2017 Mar 16. Curr Biol. 2017. PMID: 28318975
-
Employing Phylogenomics to Resolve the Relationships among Cnidarians, Ctenophores, Sponges, Placozoans, and Bilaterians.Integr Comp Biol. 2015 Dec;55(6):1084-95. doi: 10.1093/icb/icv037. Epub 2015 May 13. Integr Comp Biol. 2015. PMID: 25972566 Review.
-
The ctenophore lineage is older than sponges? That cannot be right! Or can it?J Exp Biol. 2015 Feb 15;218(Pt 4):592-7. doi: 10.1242/jeb.111872. J Exp Biol. 2015. PMID: 25696822 Review.
Cited by
-
Substitution Models of Protein Evolution with Selection on Enzymatic Activity.Mol Biol Evol. 2024 Feb 1;41(2):msae026. doi: 10.1093/molbev/msae026. Mol Biol Evol. 2024. PMID: 38314876 Free PMC article.
-
Confusion will be my epitaph: genome-scale discordance stifles phylogenetic resolution of Holothuroidea.Proc Biol Sci. 2023 Jul 12;290(2002):20230988. doi: 10.1098/rspb.2023.0988. Epub 2023 Jul 12. Proc Biol Sci. 2023. PMID: 37434530 Free PMC article.
-
The Structure of Evolutionary Model Space for Proteins across the Tree of Life.Biology (Basel). 2023 Feb 10;12(2):282. doi: 10.3390/biology12020282. Biology (Basel). 2023. PMID: 36829559 Free PMC article.
-
Highly Dynamic Gene Family Evolution Suggests Changing Roles for PON Genes Within Metazoa.Genome Biol Evol. 2023 Feb 3;15(2):evad011. doi: 10.1093/gbe/evad011. Genome Biol Evol. 2023. PMID: 36718542 Free PMC article.
-
Exploring Conflicts in Whole Genome Phylogenetics: A Case Study Within Manakins (Aves: Pipridae).Syst Biol. 2023 May 19;72(1):161-178. doi: 10.1093/sysbio/syac062. Syst Biol. 2023. PMID: 36130303 Free PMC article.
References
-
- Wickett N.J., Mirarab S., Nguyen N., Warnow T., Carpenter E., Matasci N., Ayyampalayam S., Barker M.S., Burleigh J.G., Gitzendanner M.A., et al. Phylotranscriptomic analysis of the origin and early diversification of land plants. Proc. Natl. Acad. Sci. USA. 2014;111:E4859–E4868. doi: 10.1073/pnas.1323926111. - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous
