Origins of coevolution between residues distant in protein 3D structures
- PMID: 28784799
- PMCID: PMC5576787
- DOI: 10.1073/pnas.1702664114
Origins of coevolution between residues distant in protein 3D structures
Abstract
Residue pairs that directly coevolve in protein families are generally close in protein 3D structures. Here we study the exceptions to this general trend-directly coevolving residue pairs that are distant in protein structures-to determine the origins of evolutionary pressure on spatially distant residues and to understand the sources of error in contact-based structure prediction. Over a set of 4,000 protein families, we find that 25% of directly coevolving residue pairs are separated by more than 5 Å in protein structures and 3% by more than 15 Å. The majority (91%) of directly coevolving residue pairs in the 5-15 Å range are found to be in contact in at least one homologous structure-these exceptions arise from structural variation in the family in the region containing the residues. Thirty-five percent of the exceptions greater than 15 Å are at homo-oligomeric interfaces, 19% arise from family structural variation, and 27% are in repeat proteins likely reflecting alignment errors. Of the remaining long-range exceptions (<1% of the total number of coupled pairs), many can be attributed to close interactions in an oligomeric state. Overall, the results suggest that directly coevolving residue pairs not in repeat proteins are spatially proximal in at least one biologically relevant protein conformation within the family; we find little evidence for direct coupling between residues at spatially separated allosteric and functional sites or for increased direct coupling between residue pairs on putative allosteric pathways connecting them.
Keywords: homo-oligomeric contacts; protein coevolution; structural variation.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
Chasing long-range evolutionary couplings in the AlphaFold era.Biopolymers. 2023 Mar;114(3):e23530. doi: 10.1002/bip.23530. Epub 2023 Feb 8. Biopolymers. 2023. PMID: 36752285 Free PMC article.
-
CoeViz: a web-based tool for coevolution analysis of protein residues.BMC Bioinformatics. 2016 Mar 8;17:119. doi: 10.1186/s12859-016-0975-z. BMC Bioinformatics. 2016. PMID: 26956673 Free PMC article.
-
Sequence coevolution between RNA and protein characterized by mutual information between residue triplets.PLoS One. 2012;7(1):e30022. doi: 10.1371/journal.pone.0030022. Epub 2012 Jan 18. PLoS One. 2012. PMID: 22279560 Free PMC article.
-
Inter-residue, inter-protein and inter-family coevolution: bridging the scales.Curr Opin Struct Biol. 2018 Jun;50:26-32. doi: 10.1016/j.sbi.2017.10.014. Epub 2017 Nov 5. Curr Opin Struct Biol. 2018. PMID: 29101847 Free PMC article. Review.
-
Correlated substitution analysis and the prediction of amino acid structural contacts.Brief Bioinform. 2008 Jan;9(1):46-56. doi: 10.1093/bib/bbm052. Epub 2007 Nov 13. Brief Bioinform. 2008. PMID: 18000015 Review.
Cited by
-
Analysis of 1276 Haplotype-Resolved Genomes Allows Characterization of Cis- and Trans-Abundant Genes.Methods Mol Biol. 2023;2590:237-272. doi: 10.1007/978-1-0716-2819-5_15. Methods Mol Biol. 2023. PMID: 36335503
-
Inference and reconstruction of the heimdallarchaeial ancestry of eukaryotes.Nature. 2023 Jun;618(7967):992-999. doi: 10.1038/s41586-023-06186-2. Epub 2023 Jun 14. Nature. 2023. PMID: 37316666 Free PMC article.
-
Structural motifs in protein cores and at protein-protein interfaces are different.Protein Sci. 2021 Feb;30(2):381-390. doi: 10.1002/pro.3996. Epub 2020 Nov 20. Protein Sci. 2021. PMID: 33166001 Free PMC article.
-
Genomic Signatures of Mitonuclear Coevolution in Mammals.Mol Biol Evol. 2022 Nov 3;39(11):msac233. doi: 10.1093/molbev/msac233. Mol Biol Evol. 2022. PMID: 36288802 Free PMC article.
-
Chasing long-range evolutionary couplings in the AlphaFold era.Biopolymers. 2023 Mar;114(3):e23530. doi: 10.1002/bip.23530. Epub 2023 Feb 8. Biopolymers. 2023. PMID: 36752285 Free PMC article.
References
-
- Jones DT, Buchan DWA, Cozzetto D, Pontil M. PSICOV: Precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments. Bioinformatics. 2012;28:184–190. - PubMed
-
- Ekeberg M, Lövkvist C, Lan Y, Weigt M, Aurell E. Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models. Phys Rev E Stat Nonlin Soft Matter Phys. 2013;87:012707. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
