Low complexity regions in the proteins of prokaryotes perform important functional roles and are highly conserved
- PMID: 31504783
- PMCID: PMC6821194
- DOI: 10.1093/nar/gkz730
Low complexity regions in the proteins of prokaryotes perform important functional roles and are highly conserved
Abstract
We provide the first high-throughput analysis of the properties and functional role of Low Complexity Regions (LCRs) in more than 1500 prokaryotic and phage proteomes. We observe that, contrary to a widespread belief based on older and sparse data, LCRs actually have a significant, persistent and highly conserved presence and role in many and diverse prokaryotes. Their specific amino acid content is linked to proteins with certain molecular functions, such as the binding of RNA, DNA, metal-ions and polysaccharides. In addition, LCRs have been repeatedly identified in very ancient, and usually highly expressed proteins of the translation machinery. At last, based on the amino acid content enriched in certain categories, we have developed a neural network web server to identify LCRs and accurately predict whether they can bind nucleic acids, metal-ions or are involved in chaperone functions. An evaluation of the tool showed that it is highly accurate for eukaryotic proteins as well.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures
Similar articles
-
Why do eukaryotic proteins contain more intrinsically disordered regions?PLoS Comput Biol. 2019 Jul 22;15(7):e1007186. doi: 10.1371/journal.pcbi.1007186. eCollection 2019 Jul. PLoS Comput Biol. 2019. PMID: 31329574 Free PMC article.
-
Novel conserved domains in proteins with predicted roles in eukaryotic cell-cycle regulation, decapping and RNA stability.BMC Genomics. 2004 Jul 16;5(1):45. doi: 10.1186/1471-2164-5-45. BMC Genomics. 2004. PMID: 15257761 Free PMC article.
-
PlaToLoCo: the first web meta-server for visualization and annotation of low complexity regions in proteins.Nucleic Acids Res. 2020 Jul 2;48(W1):W77-W84. doi: 10.1093/nar/gkaa339. Nucleic Acids Res. 2020. PMID: 32421769 Free PMC article.
-
Evolution of synapse complexity and diversity.Annu Rev Neurosci. 2012;35:111-31. doi: 10.1146/annurev-neuro-062111-150433. Annu Rev Neurosci. 2012. PMID: 22715880 Review.
-
Structure-function insights into prokaryotic and eukaryotic translation initiation.Curr Opin Struct Biol. 2009 Jun;19(3):300-9. doi: 10.1016/j.sbi.2009.04.010. Epub 2009 Jun 1. Curr Opin Struct Biol. 2009. PMID: 19493673 Review.
Cited by
-
Are the Head and Tail Domains of Intermediate Filaments Really Unstructured Regions?Genes (Basel). 2024 May 16;15(5):633. doi: 10.3390/genes15050633. Genes (Basel). 2024. PMID: 38790262 Free PMC article. Review.
-
Identification of Low-Complexity Domains by Compositional Signatures Reveals Class-Specific Frequencies and Functions Across the Domains of Life.PLoS Comput Biol. 2024 May 15;20(5):e1011372. doi: 10.1371/journal.pcbi.1011372. eCollection 2024 May. PLoS Comput Biol. 2024. PMID: 38748749 Free PMC article.
-
Bioinformatics tools for the sequence complexity estimates.Biophys Rev. 2023 Sep 15;15(5):1367-1378. doi: 10.1007/s12551-023-01140-y. eCollection 2023 Oct. Biophys Rev. 2023. PMID: 37974990 Free PMC article. Review.
-
MIF-like domain containing protein orchestrates cellular differentiation and virulence in the fungal pathogen Magnaporthe oryzae.iScience. 2023 Aug 6;26(9):107565. doi: 10.1016/j.isci.2023.107565. eCollection 2023 Sep 15. iScience. 2023. PMID: 37664630 Free PMC article.
-
Convergent behavior of extended stalk regions from staphylococcal surface proteins with widely divergent sequence patterns.Protein Sci. 2023 Aug;32(8):e4707. doi: 10.1002/pro.4707. Protein Sci. 2023. PMID: 37334491 Free PMC article.
References
-
- Wootton J.C. Non-globular domains in protein sequences: automated segmentation using complexity measures. Comput. Chem. 1994; 18:269–285. - PubMed
-
- Wootton J.C., Drummond M.H.. The Q-linker: a class of interdomain sequences found in bacterial multidomain regulatory proteins. Protein. Eng. 1989; 2:535–543. - PubMed
-
- Huntley M.A., Golding G.B.. Simple sequences are rare in the Protein Data Bank. Proteins. 2002; 48:134–140. - PubMed
-
- Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J.. Basic local alignment search tool. J. Mol. Biol. 1990; 215:403–410. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
